Research Interests and Activities

Roni Khardon
roni@cs.tufts.edu

Research Interests

Khardon's interests include theoretical foundations, efficient algorithms, practical aspects and applications of machine learning, data mining and knowledge representation. More generally (design and analysis of) algorithms and artificial intelligence.
Concrete problems range from learning to classify objects, learning in order to act, data mining of frequent patterns, and more. A particular focus over the last few years has been solving these in contexts where objects and relations give a natural way to describe the problems.

Current Projects

Learning to Act in Relational Markov Decision Processes ( Khardon,   Wang,   Joshi )

Markov decision processes give a mathematical model for agents acting in a dynamic environment. The agent's actions affect the world, it own state, and whether it is "rewarded" or not. However, the results of actions are not deterministic. In relational models, the state of the world is best described by referring to objects and relations among them. This is a very intuitive setting that matches many problems. Our main interest is in developing agents that learn to act in a useful way by utilizing various sources of information: examples from a teacher, exploration by trial and error, utilizing a model of the world etc. In previous work we have developed a theoretical model for learning from examples, developed algorithms and analyzed them, and developed a system L2Act for solving AI planning problem using this framework. Our current work is focused on developing useful knowledge representations for agents' internal representations (e.g. policies) that lead to efficient and robust learning to act. See the following papers (and publications page):
First Order Decision Diagrams for Relational MDPs , Proceedings of IJCAI 2007
Learning Action Strategies for Planning Domains , Artificial Intelligence Vol 113, 1999, pages 125-148.

Kernel Methods: robustness and applications to logic learning ( Khardon,   Wachman, )

Kernel Methods give a generic way to apply certain learning methods (maximum margin, perceptron, nearest neighbors) to domains with any structure. The main tool is a kernel function which computes inner products in some space, which serves as a virtual representation space for examples. This amazing idea (known since 60s) is being intensively investigated in the machine learning community. Our previous work investigated the scope and limitations of using kernel methods for logic learning. This issue turns out to be tricky since there are nice kernel constructions but they do not necessarily lead to successful learning. Our current focus is on two issues: kernel learning with relational data, and identifying learning algorithms which are robust to "noise" - a small amount of wrong annotation on training data. See the following papers (and publications page):
Learning from Interpretations: A Rooted Kernel for Ordered Hypergraphs Proceedings of ICML 2007.
Noise Tolerant Variants of the Perceptron Algorithm , Journal of Machine Learning Research (JMLR) 8(Feb):227--248, 2007.
Efficiency versus Convergence of Boolean Kernels for On-Line Learning Algorithms , Journal of AI Research (JAIR) 24(2005):341-356.

Automatic Classification of Stars in Astronomical Surveys ( Khardon,   Wachman,   Protopapas )

Huge amounts of data are now available from astronomy surveys where telescopes take measurements of the sky over a long period of time. This information is processed by experts as well as computers to provide high-level classifications and measurements of importance for astronomy. Our project aims to provide automatic methods for classifying astronomical measurements of stars using luminosity time-series measurements and other measurable properties of the stars. Such classification systems can save many expert-human-years and allow for faster paced scientific investigation. Our research develops and applies machine learning techniques for this project including combining multiple forms of data (including time-series measurements), work on kernels (related to the previous project since the data is structured), and providing interpretable results in an end to end prediction system. This work is done in collaboration with the Time Series Center at Harvard.

Mining Frequent Patterns (collaborators listed in articles)

The problem of ``mining frequent patterns'' is one of the most widely studied problems in data mining. Here, given a database capturing some activities (e.g. shopping lists in supermarkets) one searches for patterns of activities (e.g. sets of items bought together) which occur with sufficiently high frequency. This idea has been applied in a variety of real world problems. Our previous work developed formal complexity results for itemset mining, by showing an equivalence to some problems of learning by asking questions, and developed a new algorithm which is particularly efficient when patterns sought are large. Current work is focused on mining in relational data. For example given a set of molecules described by graphs (capturing a family of interest e.g. "high solubility", or "effective drug") one would like to find frequent substructures in this family. See the following papers:
On Mining Closed Sets in Multi-Relational Data , Proceedings of IJCAI 2007
Discovering All Most Specific Sentences , ACM Transactions on Database Systems , Vol 28, pages 140-174, June 2003.

Learning Expressions in First Order Logic ( Khardon,   Arias )

We are studying problems where a machine learning system sees examples and produces hypotheses which are represented by logical formulae. This generic framework fits many applications from learning moves for Chess to identifying whether molecules will work well as drugs. Our theoretical work aims to characterize the computational complexity of these problems, i.e. how easy or hard they are to solve. The work includes developing algorithms which are provably correct and efficient, and proving lower bounds that show limitations on the efficiency of any algorithm for such problems. See the following papers:
Learning Closed Horn Expressions, Information and Computation Vol 178, 2002, pages 214-240.
The Subsumption Lattice and Query Learning , Journal of Computer and System Sciences Volume 72, Issue 1, Pages 72-94 (February 2006)

LogAn-H: a system for learning from relational data ( Khardon,   Arias )

We have developed a system for learning from relational data (e.g. from graphs). The system is pretty efficient, gives a non traditional type of algorithm ("bottom up") based on the theoretical results, and is a state of the art Inductive Logic Programming (ILP) system. Current work is focused on solving large scale problems involving classification of molecules. For an introduction see links to the system below as well as:
Learning Horn Expressions with LogAn-H, Journal of Machine Learning Research (JMLR) 8(Mar):549--587, 2007.


Grants:
The following grants supported this work in the UK:
EPSRC grant GR/M21409 Learning Approximation and Reasoning
EPSRC grant GR/N03167 Learning Relational Expressions for Natural Language Applications
Current Grants:
NSF grant IIS-0099446: Learning and Reasoning with Relational Structures


Systems and Software

LogAn-H
LogAn-H is a system for learning function free Horn expressions. It is based on provably correct algorithms for learning with queries. More information can be found in:
L2Act
The L2Act system is a learning system that takes as input a description of an AI planning domain and a set of solved instances from that domain (the examples), and induces a set of rules ordered as a decision list that can be used as a reactive planner in this domain. More information can be found in:

More Information

More information can be found through my publications dealing with issues in computational learning theory, machine learning, knowledge representation and reasoning, AI planning, data mining, and parallel programs.