A recommendation system for scientific water data

April 28, 2021
11:00am-1:00pm ET
Sococo Halligan 102, Zoom
Speaker: Zhaokun Xue
Host: Alva Couch

Abstract

Thesis defense:

Recommendation systems have become a powerful feature for many online systems, including electronic commerce, video or music applications, news websites, etc. Most current recommendation systems are designed for increasing user satisfaction during use of media services. Recommendation systems for scientific data are targeted at data reuse and scientific progress. A recommendation system for scientific water data must handle scientific users’ multiple-interests behavior and objects with really large and complex sets of attributes. The context of this work is to establish a combination of the content-based filtering algorithm for hydrologic data and a Latent Dirichlet Allocation (LDA) topic modeling of user behavior on the HydroShare platform for water data sharing. The process for making scientific water data recommendations includes inferring users’ multiple-interests from the datasets with which they interact, and then calculating similarity between users’ interests set and datasets’ keywords. Evaluating such a system requires comparing objects’ keywords that are recommended with the datasets that users have already shown interest in using. Compared recommendations made by this approach with other well-known recommendation algorithms, we conclude that the combination of the content-based filtering and the LDA algorithm provides a workable solution for HydroShare.

Join the meeting in Sococo, VH 102, or Zoom.

Join Zoom Meeting: https://tufts.zoom.us/j/98610939077

PASSWORD: See colloquia email

Dial by your location: +1 646 558 8656 US (New York)

Meeting ID: 986 1093 9077

PASSCODE: See colloquia email