Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's a good point for text input, but if you go multi-modal and somehow find a way to make good use of audio and video, there's practically unlimited data available.

Also considering that humans will probably still only publish the output that looks good, even that still provides a weak signal on quality.



I'm skeptical that multimodal input can help with programming or logic problems - or even most scientific problems.


Having diagrams (think free body diagrams in static mechanics, or a T-s diagram in thermodynamics) make a lot of non-trivial problems a lot simpler to communicate. And correctly understanding an unambiguous definition of a problem is a major step towards solving it.

If language was enough (or a similar idea, that multimodal input is not useful), college math professors wouldn't use so much chalk making drawings and diagrams to explain their ideas.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: