Data Cleaning in Natural Language Provessing
M4A•Pagina episodului
Manage episode 311353555 series 3111581
Content provided by Sarvesh Bhatnagar. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Sarvesh Bhatnagar or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.
In this episode we talk about various steps in data cleaning process in Natural Language Processing. Data cleaning is almost a given whenever you want to perform natural language processing onto the given text. Data cleaning in natural language processing involves tokenization, lowering the words, lemmatization, and so on. Aside from talking about that we also talk about how you can implement those briefly. To install codesnip mentioned in the last part open your terminal and write pip install codesnip --- Send in a voice message: https://podcasters.spotify.com/pod/show/sarvesh-bhatnagar/message
…
continue reading
22 episoade