I’m looking for a small text corpus from 1 to 5 pages. With small number of words.
I want to test an algorithm for calculating word similarity which is suitable for small number of examples.
Every sentence can be different meaning i.e. the whole text does not have common context, every sentence can be on its own as long as it is a valid sentence and the total number of words is not too big.
So the criteria is 20 to 500 sentences using as much as possible small number of similar words.