Hadoop
Machine Learning
Information Retrieval
- Jimmy Lin and friends
- Jimmy Lin and Michael Schatz. Design Patterns for Efficient Graph Algorithms in MapReduce. Proceedings of the 2010 Workshop on Mining and Learning with Graphs Workshop (MLG-2010).
- Tamer Elsayed, Jimmy Lin, and Douglas Oard. Pairwise Document Similarity in Large Collections with MapReduce. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL 2008), Companion Volume, pages 265-268, June 2008, Columbus, Ohio.
- Elsayed, T. and Ture, F. and Lin, J. Brute-Force Approaches to Batch Retrieval: Scalable Indexing with MapReduce, or Why Bother?, 2010, Technical Report HCIL-2010-23, University of Maryland, College Park.
- Jimmy Lin. Brute Force and Indexed Approaches to Pairwise Document Similarity Comparisons with MapReduce.
Proceedings of the 32nd Annual International ACM SIGIR Conference on
Research and Development in Information Retrieval (SIGIR 2009), pages
155-162, July 2009
- Lin, J. Scalable language processing algorithms for the masses: A case study in computing word co-occurrence matrices with MapReduce. Proceedings of the Conference on Empirical Methods in Natural Language Processing, 419-428, 2008.
- Kolak, O. and Schilit, B.N. Generating links by mining quotations. Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, 2008.
Crowdsourcing (e.g. Mechanical Turk) + MapReduceData Compression: Big Data, More Capabilities, Faster Learning Additional Topics |
|