NLPurify
[](https://nlpurify.readthedocs.io/en/latest/?badge=latest)
[](https://github.com/sharkutilities/NLPurify/issues)
[](https://github.com/sharkutilities/NLPurify/network)
[](https://github.com/sharkutilities/NLPurify/stargazers)
[](https://github.com/sharkutilities/NLPurify/blob/master/LICENSE)
[](https://pypistats.org/packages/nlpurify)
[](https://pypi.org/project/NLPurify/)
```{toctree}
:maxdepth: 2
:hidden:
:caption: Getting Started
./about/objective.md
./about/installation.md
```
```{toctree}
:maxdepth: 2
:hidden:
:caption: Preprocessing Methods
./preprocessing/normalization.md
./preprocessing/utils.md
```
A text cleaning and extraction engine was developed using a combination of traditional techniques like Unicode translations,
cleaning using regular expressions, and modern tools like "natural language processing" and "large language models" to
detect and clean long texts and create word vectors.