This project aims for developing live search engine for Wikipedia pages, using Hadoop yarn map-reduce and Cassandra.
Four technologies are helping us here:
This project aims for developing live search engine for Wikipedia pages, using Hadoop yarn map-reduce and Cassandra.
Four technologies are helping us here:
Hadoop yarn map reduce:
Hadoop sifts throught the 3.5 GB of wikipedia page abstract