Publications

Selected Publication

Mining Social Networks for Personalized Email Prioritization
Author
Shinjae Yoo, Yiming Yang, Frank Lin, and Il-Chul Moon
Year
2009
Conference Name
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2009)
Presentation Date
Jun 28
City
Paris
Country
France
File
CF-2.pdf (377.3K) 35time download DATE : 2023-11-09 20:39:57

Shinjae Yoo, Yiming Yang, Frank Lin, and Il-Chul Moon, Mining Social Networks for Personalized Email PrioritizationACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2009), Paris, France, Jun 28, 2009

 

Abstract

Email is one of the most prevalent communication tools today, and solving the email overload problem is pressingly urgent. A good way to alleviate email overload is to automatically prioritize received messages according to the priorities of each user. However, research on statistical learning methods for fully personalized email prioritization (PEP) has been sparse due to privacy issues, since people are reluctant to share personal messages and importance judgments with the research community. It is therefore important to develop and evaluate PEP methods under the assumption that only limited training examples can be available, and that the system can only have the personal email data of each user during the training and testing of the model for that user. This paper presents the first study (to the best of our knowledge) under such an assumption. Specifically, we focus on analysis of personal social networks to capture user groups and to obtain rich features that represent the social roles from the viewpoint of a particular user. We also developed a novel semi-supervised (transductive) learning algorithm that propagates importance labels from training examples to test examples through message and user nodes in a personal email network. These methods together enable us to obtain an enriched vector representation of each new email message, which consists of both standard features of an email message (such as words in the title or body, sender and receiver IDs, etc.) and the induced social features from the sender and receivers of the message. Using the enriched vector representation as the input in SVM classifiers to predict the importance level for each test message, we obtained significant performance improvement over the baseline system (without induced social features) in our experiments on a multi-user data collection. We obtained significant performance improvement over the baseline system (without induced social features) in our experiments on a multi-user data collection: the relative error reduction in MAE was 31% in micro-averaging, and 14% in macro-averaging.


@inproceedings{Yoo:2009:MSN:1557019.1557124, 

author = {Yoo, Shinjae and Yang, Yiming and Lin, Frank and Moon, Il-Chul}, 

title = {Mining Social Networks for Personalized Email Prioritization}, 

booktitle = {Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining}, 

series = {KDD '09}, 

year = {2009}, 

isbn = {978-1-60558-495-9}, 

location = {Paris, France}, 

pages = {967--976}, 

numpages = {10}, 

url = {http://doi.acm.org/10.1145/1557019.1557124}, 

doi = {10.1145/1557019.1557124}, 

acmid = {1557124}, 

publisher = {ACM}, 

address = {New York, NY, USA}, 

keywords = {email prioritization, social network, text mining}


Source Website:  

http://dl.acm.org/citation.cfm?id=1557124&CFID=780919100&CFTOKEN=49500883