Automatic Tags Projects

Data Set of 750k Articles

As part of my plan to build an open-source NLP tool for WordPress blogs, I have collected 750,000 articles from 100,000 WordPress blogs. I’m making the data available here for others to explore. Download The data is formatted as a gzipped CSV. The file size is 919 MB. Sample A sample view of the data […]