Welcome to EduNLP’s Tutorials and Documentation¶
EduNLP is a library for advanced Natural Language Processing in Python and is one of the projects of EduX plan of BDAA. It’s built on the very latest research, and was designed from day one to be used in real educational products.
EduNLP now comes with pretrained pipelines and currently supports segment, tokenization and vertorization. It supports varies of preprocessing for NLP in educational scenario, such as formula parsing, multi-modal segment.
EduNLP is commercial open-source software, released under the Apache-2.0 license.
Install¶
EduNLP requires Python version 3.6, 3.7, 3.8 or 3.9. EduNLP use PyTorch as the backend tensor library.
We recommend installing EduNLP by pip:
pip install EduNLP
But you can also install from source:
git clone https://github.com/bigdata-ustc/EduNLP.git
cd EduNLP
pip install .
Getting Started¶
For absolute beginners, start with the Tutorial to EduNLP (中文版). It covers the basic concepts of EduNLP and a step-by-step on training, loading and using the language models.
Contribution¶
EduNLP is free software; you can redistribute it and/or modify it under the terms of the Apache License 2.0. We welcome contributions. Join us on GitHub and check out our contribution guidelines (中文版).