Entity Resolution in Big Data

Publication Year:
Usage 13
Downloads 8
Abstract Views 5
Repository URL:
https://digitalcommons.wpi.edu/mqp-all/3948; https://digitalcommons.wpi.edu/cgi/viewcontent.cgi?article=4947&context=mqp-all
Pham, Duc Minh; Vu, Thanh Long
Worcester Polytechnic Institute
Computer Science
artifact description
Today, with the rapid development of technology, human entered a new era of Information Technology. Data is being transfer from paper to digital second by second. Therefore, the demand of data storage is increasing quickly. Human need a new technique to handle Big Data, that’s why Hadoop was born. However, the conflicts and duplicates of data is still happen in many cases. In this report, we will illustrate a new technique for entity resolution in big data which uses Hadoop's Map-Reduce framework.