Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I want to explore the space of audio encoding and GPT like understanding of audio. I'm so highly interested in how a simple 1d signal must go through so much processing to be understood by language models, and am curious what tradeoffs occur. Would also be fun to make a TTS Library and understand it.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: