Recently, a friend whose company is working on large scale project reached out to us to seek a solution to a simple problem of finding a list of phrases (approximately 80,000) in a huge set of rich text documents (approx 6 million). The problem at first looked simple. The way engineers had solved it is ... Continue reading "Phrase matching using Apache Spark"The post Phrase matching using Apache Spark appeared first on CloudxLab Blog.