Skip to content

udacity/cd0377-Introduction-to-Natural-Language-Processing

Repository files navigation

Introduction to Natural Language Processing

Coding exercises for the Introduction to Natural Language Processing course.

Setup

You need Python 3.6+, and the packages mentioned in requirements.txt. You can install them using:

pip install -r requirements.txt

Data

Data files for exercises are included under data/, but some of the NLP libraries require additional data for performing tasks like PoS tagging, lemmatization, etc. Specifically, nltk will throw an error if the required data is not installed. You can use the following Python statement to open the NLTK downloader and select the desired package(s) to install:

nltk.download()

You can also download all available NLTK data packages, which includes a number of sample corpora as well, but that may take a while (10+GB).

Run

To run any script file, use:

python <script.py>

To open a notebook, use:

jupyter notebook <notebook.ipynb>

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. Please refer to Udacity Terms of Service for further information.

About

Code for cd0377- Introduction to Natural Language Processing taught by Luis Serrano and Arpan Chakraborty

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •