https://news.ycombinator.com/item?id=17808349
First edition: http://www.uokufa.edu.iq/staff/ehsanali/Tan.pdf
Also see "mining of massive datasets" usually available at this link, but it seems to be down: http://infolab.stanford.edu/~ullman/mmds/book.pdf
Which leads me to another point: Many of these books cost $100+. If you don't have those kind of resources, try Library Genesis. It's been very helpful for getting started.
https://news.ycombinator.com/item?id=17808349
First edition: http://www.uokufa.edu.iq/staff/ehsanali/Tan.pdf
Also see "mining of massive datasets" usually available at this link, but it seems to be down: http://infolab.stanford.edu/~ullman/mmds/book.pdf
Which leads me to another point: Many of these books cost $100+. If you don't have those kind of resources, try Library Genesis. It's been very helpful for getting started.