In this thesis, we develop a formalism for reasoning about human-powered data management, and use this formalism to design: The methods presented herein combine the flexibility of statistical models with key ideas and empirical observations from the data mining and social networks communities, and are supported by distributed systems research for cluster computing. Congratulations to all the outstanding students who were nominated and to the winners of this year. Since distrust is a special type of negative links, I demonstrate the generalization of properties and algorithms of distrust to negative links, i. Computational models developed with large-scale real-world behavioral data have shown significant progress in identifying these malicious entities. Furthermore, the final dissertation defense must not have taken place prior to January 1st,

This thesis develops flexible estimation procedures with provable theoretical guarantees for uncovering unknown hidden structures underlying data generating process.


SIGKDD Awards : SIGKDD Dissertation Award Winners

Travel may or may not be partially covered depending on the total availability of funds and the number of awards given. Jian Pei wins ACM SIGKDD Service Award for his significant technical contributions to the principles, practice and application of data mining disertation for his outstanding services to society and the data mining community.

We received 19 nominations this year, a new record in the history of this award. Pedro Domingos for his foundational research in data stream analysis, cost-sensitive classification, adversarial awars, and Markov logic networks.


Call for Participation, Papers, Workshops, Tutorials, Nominations

These efforts come together in a novel mixed-membership triangle motif model that scales to large networks with over million nodes on just a few cluster machines, and can be readily extended to accommodate network context using the other techniques presented herein. Odd is a great way to stay connected and contribute back. In this dissertation, we choose to focus on short user feedback i.

Statistical analysis in these high-dimensional data sets is possible only if awrad estimation procedure exploits hidden structures underlying data. The winner and runners-up will be invited to present his or her work in a special session at the KDD conference.

This tutorial discusses three broad directions of state-of-the-art data-driven methods to model malicious behavior: