I am passionate about working with data at extreme scales, and my ultimate research goal is to understand what information is and how to quantify meaning from massive, messy data. My research has been focused on large-scale information retrieval (IR) systems and data-intensive processes in distributed settings. I have worked on various projects related to decentralized search, distributed computing for text processing, machine learning, complex networks/systems, and information theoretic modeling.
Download CV in PDFPhD in Information Science, 2010
UNC Chapel Hill
Masters in Information Science, 2006
Indiana University Bloomington
Bachelors in Chemical Engineering, 1998
East China University of Science and Technology
Curiosity & Hands-on Experiments for Science and Engineering
Machine learning, deep reinforcement learning, training and fine tuning with DLITE loss.
The definition and measurement of information is fundamental to methods for information retrieval, text mining, and machine learning.
Data aggregated and retrieval augmented LLMs for reference chat analysis and tool development for library resource and service optimization.
Decentralized search and retrieval on the web scale. Efficiency, effectiveness, and scalability.
Some publications i have recently published
Industry to Academia