Just relevant parts. ES says their max size is 100 mb. We have a real life scenario where we want to index millions of office documents to find PII/PHI
What is the realistic expectation here. Should we say 50 mb. How everybody else do?
Not sure about ES, but Solr removed it's max field limit in release 4.0. Text documents tend to be a lot smaller than people expect, both in terms of word count and file size. I think you will be fine with 50 mb if you are using ES.
What is the realistic expectation here. Should we say 50 mb. How everybody else do?