Mark James Carman
News!
5 Jan, 2008:
I have started
blogging(?!) about interesting research articles I read, as well as my musings on research in Artificial Intelligence / Information Retrieval / Databases / etc. I will also post any presentations I give or papers I write. You can check it out
here. Feel free to agree/disagree with me in the comments!
29 Nov, 2007:
I gave a quick talk today (along with
Giovanni Toffetti and
Monica Landoni) to the new PhD students at the University of Lugano on
life during and after a PhD. - Finally my chance to tell the new students all the stuff I wish somebody had told me when I started! I'm posting
my advice/musings online in case somebody else finds it useful.
22 Nov, 2007:
Today I gave a
lecture on Web 2.0 to 3rd year undergraduate students in Informatics at the University of Lugano.
I concentrated on the themes of user-generated content (wikis, blogging, tagging), applications (mash-ups), social networks and personalization. For each area I gave an overview of current techniques and described some examples, before discussing some research areas (and providing pointers to further information).
The slides of the presentation are
available online. - Being a presentation on Web 2.0 I thought that using an online office suite was appropriate....
11 Sep, 2007:
Craig Knoblock and I published an article "
Learning Semantic Definitions of Online Information Sources" in the Journal of Artificial Intelligence Research (
JAIR). The article provides a more detailed description of our work on inducing service descriptions that we presented at IJCAI.
6 Aug, 2007:
I have just taken up a PostDoc position in the
Informatics Faculty of the
University of Lugano, Switzerland. I will be working with
Fabio Crestani on the problem of discovering, modeling and providing personalized access to news feeds, blogs, and other online data sources. I am very excited about the work and see a great opportunity for combining
Data Integration, Personalization and Distributed
Information Retrieval techniques.
14 Feb, 2007:
The
software I wrote for my thesis for
learning definitions of web sources has just been made available on the ISI website! The
package is royalty-free for research purposes and comes with all the source code.
Documentation is "in progress", so feel free to contact me with installation questions.
9 Jan, 2007:
Here are the
slides that I presented today at
IJCAI-07.
About me
I am a PhD Graduate in Computer Science from the
University of Trento, Italy.
Until recently, I was working/studying at the
Information Sciences Institute of the
University of Southern California in the
Information Integration Research Group under the guidance of
Craig Knoblock.
I have now moved to Switzerland and taken up a PostDoc position in the Informatics Faculty of the University of Lugano. I am working with Fabio Crestani on the problem of discovering, modeling and providing personalized access to news feeds, blogs, and other online data sources. To tackle the problem I plan to combine techniques from Data Integration, Personalization and Distributed Information Retrieval.
My research interests are varied and include the following areas:
- Machine Learningw
& Web/Data Miningw
Learning relational models for semi-structured data sources, (such as e-commerce websites, weather feeds, hotel search services, etc.)
- Information Integrationw, Information Retrievalw & Web Servicesw
Query reformulation algorithms and schema matching/mapping techniques for integrating data from heterogeneous sources and automatically generating mash-upsw.
- Automated Planningw & Searchw
Constraint Satisfaction (CSP)w & Propositional Satisfiability (SAT)w encodings of Web-agent reasoning, (e.g. Web Service orchestration).
- Resource Optimisation for Computational Gridsw
Development of distributed marketplace algorithms for optimizing data access and availability on a DataGrid.
My research
Recently, I have been working on a system for learning semantic descriptions of online information sources. The aim of the work is to allow for the automated
discovery and integration of new sources into existing integration systems (such as information mediators or simple
mashups). The algorithm we developed induces conjunctive query definitions for information providing services (such as
these RSS feeds from Yahoo) in terms of other possibly simpler information sources (such as
these). A good overview of what our system can do is found in this paper:
You can
download the software we have developed from the ISI website. The
software is royalty-free for research purposes and comes with all the source code.
Documentation is limited, so feel free to contact me with installation questions.
A detailed descripton of how the system works can be found in this article:
You can also have a look at the
slides I presented at my thesis defense, or watch the
video of a seminar I gave at ISI last September. (Unfortunately the first 4 minutes of audio is missing, but the video shows the slides as well, so you shouldn't have any problems following the presentation.)
Publications
Here are some of my publications:
Software
EIDOS: Efficiently Inducing Definitions for Online Sources [Download Page]
EIDOS is a system for learning semantic descriptions of online information sources (such as
these RSS feeds). The descriptions are used to
automatically integrate the sources into (mediator based) information integration systems. A complete description of the purpose and functionality of the system can be found in my
thesis.
You can
download the software from the ISI website. It is royalty-free for research purposes and comes with all the source code. Here is the latest
documentation. Feel free to contact me with installation questions.
Other stuff
People
Over the last few years I've been lucky enough to work with a number of talented people including:
Marco Aiello,
José Luis Ambite,
Paolo Busetta,
Yao-Yi Chiang,
Dan Goldberg,
Craig Knoblock,
Kristina Lerman,
Matt Michelson,
Martin Michalowski,
Mike Papazoglou,
Luciano Serafini,
Kurt Stockinger,
Snehal Thakkar,
Paolo Traverso,
Rattapoom (Pipe) Tuchinda,
Floriano Zini
Blogs
I have a
research blog, which I post my papers, presentations and thoughts to occasionally.
Here are some personal blogs of different computer science researchers that I read:
For research into social networks, web mining, etc.:
If you are interested in mashups and geospatial data integration, I recommend:
And to find out what's going on at the Information Sciences Institute of USC, check out the
ai-grads blog.
I received a Bachelor's Degree (with First Class Honours) in Electrical and Electronic Engineering and Arts from the
University of Adelaide in 1999.