Skip to content

Latest commit

 

History

History
236 lines (235 loc) · 70.3 KB

File metadata and controls

236 lines (235 loc) · 70.3 KB

KDD2017 Paper List

论文 作者 组织 摘要 翻译 代码 引用数
metapath2vec: Scalable Representation Learning for Heterogeneous Networks Yuxiao Dong, Nitesh V. Chawla, Ananthram Swami code 1042
Anomaly Detection with Robust Deep Autoencoders Chong Zhou, Randy C. Paffenroth code 542
struc2vec: Learning Node Representations from Structural Identity Leonardo Filipe Rodrigues Ribeiro, Pedro H. P. Saverese, Daniel R. Figueiredo code 471
Algorithmic Decision Making and the Cost of Fairness Sam CorbettDavies, Emma Pierson, Avi Feller, Sharad Goel, Aziz Huq code 349
Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks Huan Zhao, Quanming Yao, Jianda Li, Yangqiu Song, Dik Lun Lee code 289
GRAM: Graph-based Attention Model for Healthcare Representation Learning Edward Choi, Mohammad Taha Bahadori, Le Song, Walter F. Stewart, Jimeng Sun code 271
Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks Fenglong Ma, Radha Chitta, Jing Zhou, Quanzeng You, Tong Sun, Jing Gao code 256
Google Vizier: A Service for Black-Box Optimization Daniel Golovin, Benjamin Solnik, Subhodeep Moitra, Greg Kochanski, John Karro, D. Sculley code 239
Patient Subtyping via Time-Aware LSTM Networks Inci M. Baytas, Cao Xiao, Xi Zhang, Fei Wang, Anil K. Jain, Jiayu Zhou code 236
Local Higher-Order Graph Clustering Hao Yin, Austin R. Benson, Jure Leskovec, David F. Gleich code 236
Embedding-based News Recommendation for Millions of Users Shumpei Okura, Yukihiro Tagami, Shingo Ono, Akira Tajima code 211
Collaborative Variational Autoencoder for Recommender Systems Xiaopeng Li, James She code 209
Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation Carl Yang, Lanxiao Bai, Chao Zhang, Quan Yuan, Jiawei Han code 182
The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms Yongxin Tong, Yuqiang Chen, Zimu Zhou, Lei Chen, Jie Wang, Qiang Yang, Jieping Ye, Weifeng Lv code 175
Stock Price Prediction via Discovering Multi-Frequency Trading Patterns Liheng Zhang, Charu C. Aggarwal, GuoJun Qi code 143
TFX: A TensorFlow-Based Production-Scale Machine Learning Platform Denis Baylor, Eric Breck, HengTze Cheng, Noah Fiedel, Chuan Yu Foo, Zakaria Haque, Salem Haykal, Mustafa Ispir, Vihan Jain, Levent Koc, Chiu Yuen Koo, Lukasz Lew, Clemens Mewald, Akshay Naresh Modi, Neoklis Polyzotis, Sukriti Ramesh, Sudip Roy, Steven Euijong Whang, Martin Wicke, Jarek Wilkiewicz, Xin Zhang, Martin Zinkevich code 142
Anomaly Detection in Streams with Extreme Value Theory Alban Siffer, PierreAlain Fouque, Alexandre Termier, Christine Largouët code 139
HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network Shifu Hou, Yanfang Ye, Yangqiu Song, Melih Abdulhayoglu code 135
A Taxi Order Dispatch Model based On Combinatorial Optimization Lingyu Zhang, Tao Hu, Yue Min, Guobin Wu, Junying Zhang, Pengcheng Feng, Pinghua Gong, Jieping Ye code 128
Machine Learning for Encrypted Malware Traffic Classification: Accounting for Noisy Labels and Non-Stationarity Blake Anderson, David A. McGrew code 123
Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster Naeemul Hassan, Fatma Arslan, Chengkai Li, Mark Tremayne code 118
Planning Bike Lanes based on Sharing-Bikes' Trajectories Jie Bao, Tianfu He, Sijie Ruan, Yanhua Li, Yu Zheng code 116
Learning from Multiple Teacher Networks Shan You, Chang Xu, Chao Xu, Dacheng Tao code 108
Aspect Based Recommendations: Recommending Items with the Most Valuable Aspects Based on User Reviews Konstantin Bauman, Bing Liu, Alexander Tuzhilin code 96
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data David Hallac, Sagar Vare, Stephen P. Boyd, Jure Leskovec code 93
Weisfeiler-Lehman Neural Machine for Link Prediction Muhan Zhang, Yixin Chen code 88
Dynamic Attention Deep Model for Article Recommendation by Learning Human Editors' Demonstration Xuejian Wang, Lantao Yu, Kan Ren, Guanyu Tao, Weinan Zhang, Yong Yu, Jun Wang code 83
TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams Chao Zhang, Liyuan Liu, Dongming Lei, Quan Yuan, Honglei Zhuang, Tim Hanratty, Jiawei Han code 80
DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection Bokai Cao, Lei Zheng, Chenwei Zhang, Philip S. Yu, Andrea Piscitello, John Zulueta, Olu Ajilore, Kelly Ryan, Alex D. Leow code 80
DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution Thomas Vandal, Evan Kodra, Sangram Ganguly, Andrew R. Michaelis, Ramakrishna R. Nemani, Auroop R. Ganguly code 79
ReasoNet: Learning to Stop Reading in Machine Comprehension Yelong Shen, PoSen Huang, Jianfeng Gao, Weizhu Chen code 78
Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale Adrian Albert, Jasleen Kaur, Marta C. González code 77
Network Inference via the Time-Varying Graphical Lasso David Hallac, Youngsuk Park, Stephen P. Boyd, Jure Leskovec code 73
A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments Pavel A. Dmitriev, Somit Gupta, Dong Woo Kim, Garnet Jason Vaz code 70
Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking Gabriele Tolomei, Fabrizio Silvestri, Andrew Haines, Mounia Lalmas code 69
Adversary Resistant Deep Neural Networks with an Application to Malware Detection Qinglong Wang, Wenbo Guo, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles code 69
On Sampling Strategies for Neural Network-based Collaborative Filtering Ting Chen, Yizhou Sun, Yue Shi, Liangjie Hong code 67
AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification Yukihiro Tagami code 61
LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity Yutao Zhang, Robert Chen, Jie Tang, Walter F. Stewart, Jimeng Sun code 58
Optimized Cost per Click in Taobao Display Advertising Han Zhu, Junqi Jin, Chang Tan, Fei Pan, Yifan Zeng, Han Li, Kun Gai code 52
Not All Passes Are Created Equal: Objectively Measuring the Risk and Reward of Passes in Soccer from Tracking Data Paul Power, Héctor Ruiz, Xinyu Wei, Patrick Lucey code 52
Scalable and Sustainable Deep Learning via Randomized Hashing Ryan Spring, Anshumali Shrivastava code 50
FORA: Simple and Effective Approximate Single-Source Personalized PageRank Sibo Wang, Renchi Yang, Xiaokui Xiao, Zhewei Wei, Yin Yang code 49
Visual Search at eBay Fan Yang, Ajinkya Kale, Yury Bubnov, Leon Stein, Qiaosong Wang, M. Hadi Kiapour, Robinson Piramuthu code 48
Point-of-Interest Demand Modeling with Human Mobility Patterns Yanchi Liu, Chuanren Liu, Xinjiang Lu, Mingfei Teng, Hengshu Zhu, Hui Xiong code 48
Cascade Ranking for Operational E-commerce Search Shichen Liu, Fei Xiao, Wenwu Ou, Luo Si code 45
The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables Himabindu Lakkaraju, Jon M. Kleinberg, Jure Leskovec, Jens Ludwig, Sendhil Mullainathan code 45
Extremely Fast Decision Tree Mining for Evolving Data Streams Albert Bifet, Jiajin Zhang, Wei Fan, Cheng He, Jianfeng Zhang, Jianfeng Qian, Geoff Holmes, Bernhard Pfahringer code 44
Federated Tensor Factorization for Computational Phenotyping Yejin Kim, Jimeng Sun, Hwanjo Yu, Xiaoqian Jiang code 44
When is a Network a Network?: Multi-Order Graphical Model Selection in Pathways and Temporal Networks Ingo Scholtes code 44
A Location-Sentiment-Aware Recommender System for Both Home-Town and Out-of-Town Users Hao Wang, Yanmei Fu, Qinyong Wang, Hongzhi Yin, Changying Du, Hui Xiong code 42
A Hybrid Framework for Text Modeling with Convolutional RNN Chenglong Wang, Feijun Jiang, Hongxia Yang code 42
PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification Ian EnHsu Yen, Xiangru Huang, Wei Dai, Pradeep Ravikumar, Inderjit S. Dhillon, Eric P. Xing code 41
Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion Junming Liu, Leilei Sun, Qiao Li, Jingci Ming, Yanchi Liu, Hui Xiong code 41
Discrete Content-aware Matrix Factorization Defu Lian, Rui Liu, Yong Ge, Kai Zheng, Xing Xie, Longbing Cao code 39
A Local Algorithm for Structure-Preserving Graph Cut Dawei Zhou, Si Zhang, Mehmet Yigit Yildirim, Scott Alcorn, Hanghang Tong, Hasan Davulcu, Jingrui He code 39
Inductive Semi-supervised Multi-Label Learning with Co-Training Wang Zhan, MinLing Zhang code 38
Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters Alessandro Epasto, Silvio Lattanzi, Renato Paes Leme code 37
MetaPAD: Meta Pattern Discovery from Massive Text Corpora Meng Jiang, Jingbo Shang, Taylor Cassidy, Xiang Ren, Lance M. Kaplan, Timothy P. Hanratty, Jiawei Han code 37
Prospecting the Career Development of Talents: A Survival Analysis Perspective Huayu Li, Yong Ge, Hengshu Zhu, Hui Xiong, Hongke Zhao code 37
Backpage and Bitcoin: Uncovering Human Traffickers Rebecca S. Portnoff, Danny Yuxing Huang, Periwinkle Doerfler, Sadia Afroz, Damon McCoy code 37
Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes Pengfei Wang, Yanjie Fu, Guannan Liu, Wenqing Hu, Charu C. Aggarwal code 36
Customer Lifetime Value Prediction Using Embeddings Benjamin Paul Chamberlain, Ângelo Cardoso, C. H. Bryan Liu, Roberto Pagliari, Marc Peter Deisenroth code 35
An Efficient Bandit Algorithm for Realtime Multivariate Optimization Daniel N. Hill, Houssam Nassif, Yi Liu, Anand Iyer, S. V. N. Vishwanathan code 35
KATE: K-Competitive Autoencoder for Text Yu Chen, Mohammed J. Zaki code 35
A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations Yuxiao Dong, Hao Ma, Zhihong Shen, Kuansan Wang code 35
Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams Junming Liu, Yanjie Fu, Jingci Ming, Yong Ren, Leilei Sun, Hui Xiong code 34
Graph Edge Partitioning via Neighborhood Heuristic Chenzi Zhang, Fan Wei, Qin Liu, Zhihao Gavin Tang, Zhenguo Li code 34
Peeking at A/B Tests: Why it matters, and what to do about it Ramesh Johari, Pete Koomen, Leonid Pekelis, David Walsh code 33
A Hierarchical Algorithm for Extreme Clustering Ari Kobren, Nicholas Monath, Akshay Krishnamurthy, Andrew McCallum code 33
A Context-aware Attention Network for Interactive Question Answering Huayu Li, Martin Renqiang Min, Yong Ge, Asim Kadav code 33
An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance Edward Raff, Charles K. Nicholas code 32
Robust Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves Latent Embeddings Aleksandar Bojchevski, Yves Matkovic, Stephan Günnemann code 31
Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates Liyang Xie, Inci M. Baytas, Kaixiang Lin, Jiayu Zhou code 31
Multi-Aspect Streaming Tensor Completion Qingquan Song, Xiao Huang, Hancheng Ge, James Caverlee, Xia Hu code 31
Learning Certifiably Optimal Rule Lists Elaine Angelino, Nicholas LarusStone, Daniel Alabi, Margo I. Seltzer, Cynthia Rudin code 31
Structural Deep Brain Network Mining Shen Wang, Lifang He, Bokai Cao, ChunTa Lu, Philip S. Yu, Ann B. Ragin code 31
FIRST: Fast Interactive Attributed Subgraph Matching Boxin Du, Si Zhang, Nan Cao, Hanghang Tong code 30
Compass: Spatio Temporal Sentiment Analysis of US Election What Twitter Says! Debjyoti Paul, Feifei Li, Murali Krishna Teja, Xin Yu, Richie Frost code 30
Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources Fenglong Ma, Chuishi Meng, Houping Xiao, Qi Li, Jing Gao, Lu Su, Aidong Zhang code 30
Matrix Profile V: A Generic Technique to Incorporate Domain Knowledge into Motif Discovery Hoang Anh Dau, Eamonn J. Keogh code 29
DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams Kijung Shin, Bryan Hooi, Jisu Kim, Christos Faloutsos code 29
Fast Enumeration of Large k-Plexes Alessio Conte, Donatella Firmani, Caterina Mordente, Maurizio Patrignani, Riccardo Torlone code 29
SPARTan: Scalable PARAFAC2 for Large & Sparse Data Ioakeim Perros, Evangelos E. Papalexakis, Fei Wang, Richard W. Vuduc, Elizabeth Searles, Michael Thompson, Jimeng Sun code 29
Collaboratively Improving Topic Discovery and Word Embeddings by Coordinating Global and Local Contexts Guangxu Xun, Yaliang Li, Jing Gao, Aidong Zhang code 27
EmbedJoin: Efficient Edit Similarity Joins via Embeddings Haoyu Zhang, Qin Zhang code 27
MOLIERE: Automatic Biomedical Hypothesis Generation System Justin Sybrandt, Michael Shtutman, Ilya Safro code 27
KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial Jun Zhou, Xiaolong Li, Peilin Zhao, Chaochao Chen, Longfei Li, Xinxing Yang, Qing Cui, Jin Yu, Xu Chen, Yi Ding, Yuan (Alan) Qi code 26
Post Processing Recommender Systems for Diversity Arda Antikacioglu, R. Ravi code 25
Prognosis and Diagnosis of Parkinson's Disease Using Multi-Task Learning Saba Emrani, Anya McGuirk, Wei Xiao code 25
Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing Kun Kuang, Peng Cui, Bo Li, Meng Jiang, Shiqiang Yang code 25
Unsupervised Feature Selection in Signed Social Networks Kewei Cheng, Jundong Li, Huan Liu code 25
Detecting Network Effects: Randomizing Over Randomized Experiments Martin Saveski, Jean PougetAbadie, Guillaume SaintJacques, Weitao Duan, Souvik Ghosh, Ya Xu, Edoardo M. Airoldi code 25
Contextual Spatial Outlier Detection with Metric Learning Guanjie Zheng, Susan L. Brantley, Thomas Lauvaux, Zhenhui Li code 25
Optimized Risk Scores Berk Ustun, Cynthia Rudin code 24
FLAP: An End-to-End Event Log Analysis Platform for System Management Tao Li, Yexi Jiang, Chunqiu Zeng, Bin Xia, Zheng Liu, Wubai Zhou, Xiaolong Zhu, Wentao Wang, Liang Zhang, Jun Wu, Li Xue, Dewei Bao code 24
Developing a Comprehensive Framework for Multimodal Feature Extraction Quinten McNamara, Alejandro de la Vega, Tal Yarkoni code 24
Efficient Correlated Topic Modeling with Topic Embedding Junxian He, Zhiting Hu, Taylor BergKirkpatrick, Ying Huang, Eric P. Xing code 23
Discovering Reliable Approximate Functional Dependencies Panagiotis Mandros, Mario Boley, Jilles Vreeken code 23
Distributed Multi-Task Relationship Learning Sulin Liu, Sinno Jialin Pan, Qirong Ho code 23
Deep Embedding Forest: Forest-based Serving with Deep Embedding Features Jie Zhu, Ying Shan, J. C. Mao, Dong Yu, Holakou Rahmanian, Yi Zhang code 23
Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers Aman Agarwal, Soumya Basu, Tobias Schnabel, Thorsten Joachims code 23
Accelerating Innovation Through Analogy Mining Tom Hope, Joel Chan, Aniket Kittur, Dafna Shahaf code 23
Incremental Dual-memory LSTM in Land Cover Prediction Xiaowei Jia, Ankush Khandelwal, Guruprasad Nayak, James Gerber, Kimberly Carlson, Paul C. West, Vipin Kumar code 23
Automatic Synonym Discovery with Knowledge Bases Meng Qu, Xiang Ren, Jiawei Han code 23
Achieving Non-Discrimination in Data Release Lu Zhang, Yongkai Wu, Xintao Wu code 22
Deep Choice Model Using Pointer Networks for Airline Itinerary Prediction Alejandro Mottini, Rodrigo AcunaAgost code 22
Automated Categorization of Onion Sites for Analyzing the Darkweb Ecosystem Shalini Ghosh, Ariyam Das, Phillip A. Porras, Vinod Yegneswaran, Ashish Gehani code 22
PAMAE: Parallel k-Medoids Clustering with High Accuracy and Efficiency Hwanjun Song, JaeGil Lee, WookShin Han code 21
DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks Zi Yin, Kenghao Chang, Ruofei Zhang code 21
Linearized GMM Kernels and Normalized Random Fourier Features Ping Li code 20
STAR: A System for Ticket Analysis and Resolution Wubai Zhou, Wei Xue, Ramesh Baral, Qing Wang, Chunqiu Zeng, Tao Li, Jian Xu, Zheng Liu, Larisa Shwartz, Genady Ya. Grabarnik code 20
Resolving the Bias in Electronic Medical Records Kaiping Zheng, Jinyang Gao, Kee Yuan Ngiam, Beng Chin Ooi, James Wei Luen Yip code 19
Clustering Individual Transactional Data for Masses of Users Riccardo Guidotti, Anna Monreale, Mirco Nanni, Fosca Giannotti, Dino Pedreschi code 19
No Longer Sleeping with a Bomb: A Duet System for Protecting Urban Safety from Dangerous Goods Jingyuan Wang, Chao Chen, Junjie Wu, Zhang Xiong code 19
Large Scale Sentiment Learning with Limited Labels Vasileios Iosifidis, Eirini Ntoutsi code 19
Predicting Clinical Outcomes Across Changing Electronic Health Record Systems Jen J. Gong, Tristan Naumann, Peter Szolovits, John V. Guttag code 18
Multi-Modality Disease Modeling via Collective Deep Matrix Factorization Qi Wang, Mengying Sun, Liang Zhan, Paul Thompson, Shuiwang Ji, Jiayu Zhou code 17
LiJAR: A System for Job Application Redistribution towards Efficient Career Marketplace Fedor Borisyuk, Liang Zhang, Krishnaram Kenthapadi code 17
Discovering Pollution Sources and Propagation Patterns in Urban Area Xiucheng Li, Yun Cheng, Gao Cong, Lisi Chen code 17
PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks Yu Shi, PoWei Chan, Honglei Zhuang, Huan Gui, Jiawei Han code 16
On Finding Socially Tenuous Groups for Online Social Networks ChihYa Shen, LiangHao Huang, DeNian Yang, HongHan Shuai, WangChien Lee, MingSyan Chen code 16
Formative Essay Feedback Using Predictive Scoring Models Bronwyn Woods, David Adamson, Shayne Miel, Elijah Mayfield code 16
Quick Access: Building a Smart Experience for Google Drive Sandeep Tata, Alexandrin Popescul, Marc Najork, Mike Colagrosso, Julian Gibbons, Alan Green, Alexandre Mah, Michael Smith, Divanshu Garg, Cayden Meyer, Reuben Kan code 16
Revisiting Power-law Distributions in Spectra of Real World Networks Nicole Eikmeier, David F. Gleich code 15
Recurrent Poisson Factorization for Temporal Recommendation Seyyed Abbas Hosseini, Keivan Alizadeh, Ali Khodadadi, Ali Arabzadeh, Mehrdad Farajtabar, Hongyuan Zha, Hamid R. Rabiee code 15
Optimization Beyond Prediction: Prescriptive Price Optimization Shinji Ito, Ryohei Fujimaki code 15
A Minimal Variance Estimator for the Cardinality of Big Data Set Intersection Reuven Cohen, Liran Katzir, Aviv Yehezkel code 14
Unsupervised Network Discovery for Brain Imaging Data Zilong Bai, Peter B. Walker, Anna E. Tschiffely, Fei Wang, Ian Davidson code 14
Semi-Supervised Techniques for Mining Learning Outcomes and Prerequisites Igor Labutov, Yun Huang, Peter Brusilovsky, Daqing He code 14
Convex Factorization Machine for Toxicogenomics Prediction Makoto Yamada, Wenzhao Lian, Amit Goyal, Jianhui Chen, Kishan Wimalawarne, Suleiman A. Khan, Samuel Kaski, Hiroshi Mamitsuka, Yi Chang code 14
Distributed Local Outlier Detection in Big Data Yizhou Yan, Lei Cao, Caitlin Kuhlman, Elke A. Rundensteiner code 14
Similarity Forests Saket Sathe, Charu C. Aggarwal code 13
Scalable Top-n Local Outlier Detection Yizhou Yan, Lei Cao, Elke A. Rundensteiner code 13
Long Short Memory Process: Modeling Growth Dynamics of Microscopic Social Connectivity Chengxi Zang, Peng Cui, Christos Faloutsos, Wenwu Zhu code 13
TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks HengTze Cheng, Zakaria Haque, Lichan Hong, Mustafa Ispir, Clemens Mewald, Illia Polosukhin, Georgios Roumpos, D. Sculley, Jamie Smith, David Soergel, Yuan Tang, Philipp Tucker, Martin Wicke, Cassandra Xia, Jianwei Xie code 12
Towards an Optimal Subspace for K-Means Dominik Mautz, Wei Ye, Claudia Plant, Christian Böhm code 12
Statistical Emerging Pattern Mining with Multiple Testing Correction Junpei Komiyama, Masakazu Ishihata, Hiroki Arimura, Takashi Nishibayashi, Shinichi Minato code 12
"The Leicester City Fairytale?": Utilizing New Soccer Analytics Tools to Compare Performance in the 15/16 & 16/17 EPL Seasons Héctor Ruiz, Paul Power, Xinyu Wei, Patrick Lucey code 12
The Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue Ashlesh Sharma, Vidyuth Srinivasan, Vishal Kanchan, Lakshminarayanan Subramanian code 11
Tracking the Dynamics in Crowdfunding Hongke Zhao, Hefu Zhang, Yong Ge, Qi Liu, Enhong Chen, Huayu Li, Le Wu code 11
Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks Yuxiao Dong, Reid A. Johnson, Jian Xu, Nitesh V. Chawla code 11
Structural Event Detection from Log Messages Fei Wu, Pranay Anchuri, Zhenhui Li code 11
A Data Mining Framework for Valuing Large Portfolios of Variable Annuities Guojun Gan, Jimmy Xiangji Huang code 11
MARAS: Signaling Multi-Drug Adverse Reactions Xiao Qin, Tabassum Kakar, Susmitha Wunnava, Elke A. Rundensteiner, Lei Cao code 11
Finding Precursors to Anomalous Drop in Airspeed During a Flight's Takeoff Vijay Manikandan Janakiraman, Bryan L. Matthews, Nikunj C. Oza code 11
A Data-driven Process Recommender Framework Sen Yang, Xin Dong, Leilei Sun, Yichen Zhou, Richard A. Farneth, Hui Xiong, Randall S. Burd, Ivan Marsic code 10
Learning Temporal State of Diabetes Patients via Combining Behavioral and Demographic Data Houping Xiao, Jing Gao, Long H. Vu, Deepak S. Turaga code 10
Functional Annotation of Human Protein Coding Isoforms via Non-convex Multi-Instance Learning Tingjin Luo, Weizhong Zhang, Shang Qiu, Yang Yang, Dongyun Yi, Guangtao Wang, Jieping Ye, Jie Wang code 10
Inferring the Strength of Social Ties: A Community-Driven Approach Polina Rozenshtein, Nikolaj Tatti, Aristides Gionis code 10
Luck is Hard to Beat: The Difficulty of Sports Prediction Raquel Y. S. Aoki, Renato Martins Assunção, Pedro O. S. Vaz de Melo code 10
A Data Science Approach to Understanding Residential Water Contamination in Flint Alex Chojnacki, Chengyu Dai, Arya Farahi, Guangsha Shi, Jared Webb, Daniel T. Zhang, Jacob D. Abernethy, Eric M. Schwartz code 10
Anarchists, Unite: Practical Entropy Approximation for Distributed Streams Moshe Gabel, Daniel Keren, Assaf Schuster code 9
Bolt: Accelerated Data Mining with Fast Vector Compression Davis W. Blalock, John V. Guttag code 9
End-to-end Learning for Short Text Expansion Jian Tang, Yue Wang, Kai Zheng, Qiaozhu Mei code 9
RUSH!: Targeted Time-limited Coupons via Purchase Forecasts Emaad A. Manzoor, Leman Akoglu code 9
Retrospective Higher-Order Markov Processes for User Trails Tao Wu, David F. Gleich code 8
Deep Design: Product Aesthetics for Heterogeneous Markets Yanxin Pan, Alexander Burnap, Jeffrey Hartley, Richard Gonzalez, Panos Y. Papalambros code 8
Robust Top-k Multiclass SVM for Visual Category Recognition Xiaojun Chang, Yaoliang Yu, Yi Yang code 8
The Co-Evolution Model for Social Network Evolving and Opinion Migration Yupeng Gu, Yizhou Sun, Jianxi Gao code 8
BDT: Gradient Boosted Decision Tables for High Accuracy and Scoring Efficiency Yin Lou, Mikhail Obukhov code 8
Big Data in Climate: Opportunities and Challenges for Machine Learning Anuj Karpatne, Vipin Kumar code 7
A Temporally Heterogeneous Survival Framework with Application to Social Behavior Dynamics Linyun Yu, Peng Cui, Chaoming Song, Tianyang Zhang, Shiqiang Yang code 7
Collecting and Analyzing Millions of mHealth Data Streams Tom Quisel, Luca Foschini, Alessio Signorini, David C. Kale code 7
Constructivism Learning: A Learning Paradigm for Transparent Predictive Analytics Xiaoli Li, Jun Huan code 7
Relay-Linking Models for Prominence and Obsolescence in Evolving Networks Mayank Singh, Rajdeep Sarkar, Pawan Goyal, Animesh Mukherjee, Soumen Chakrabarti code 7
Decomposed Normalized Maximum Likelihood Codelength Criterion for Selecting Hierarchical Latent Variable Models Tianyi Wu, Shinya Sugawara, Kenji Yamanishi code 7
Supporting Employer Name Normalization at both Entity and Cluster Level Qiaoling Liu, Faizan Javed, Vachik S. Dave, Ankita Joshi code 7
Contextual Motifs: Increasing the Utility of Motifs using Contextual Data Ian Fox, Lynn Ang, Mamta Jaiswal, Rodica PopBusui, Jenna Wiens code 6
Groups-Keeping Solution Path Algorithm for Sparse Regression with Automatic Feature Grouping Bin Gu, Guodong Liu, Heng Huang code 6
Fast Newton Hard Thresholding Pursuit for Sparsity Constrained Nonconvex Optimization Jinghui Chen, Quanquan Gu code 6
Learning from Labeled and Unlabeled Vertices in Networks Wei Ye, Linfei Zhou, Dominik Mautz, Claudia Plant, Christian Böhm code 6
Real-Time Optimization of Web Publisher RTB Revenues Pedro Chahuara, Nicolas Grislain, Grégoire Jauvion, JeanMichel Renders code 6
AESOP: Automatic Policy Learning for Predicting and Mitigating Network Service Impairments Supratim Deb, Zihui Ge, Sastry Isukapalli, Sarat C. Puthenpura, Shobha Venkataraman, He Yan, Jennifer Yates code 6
Large-scale Collaborative Ranking in Near-Linear Time Liwei Wu, ChoJui Hsieh, James Sharpnack code 5
Online Ranking with Constraints: A Primal-Dual Algorithm and Applications to Web Traffic-Shaping Parikshit Shah, Akshay Soni, Troy Chevalier code 5
A Practical Exploration System for Search Advertising Parikshit Shah, Ming Yang, Sachidanand Alle, Adwait Ratnaparkhi, Ben Shahshahani, Rohit Chandra code 5
Matching Restaurant Menus to Crowdsourced Food Data: A Scalable Machine Learning Approach Hesam Salehian, Patrick D. Howell, Chul Lee code 5
Discovering Enterprise Concepts Using Spreadsheet Tables Keqian Li, Yeye He, Kris Ganjam code 5
A Quasi-experimental Estimate of the Impact of P2P Transportation Platforms on Urban Consumer Patterns Zhe Zhang, Beibei Li code 5
Tripoles: A New Class of Relationships in Time Series Data Saurabh Agrawal, Gowtham Atluri, Anuj Karpatne, William Haltom, Stefan Liess, Snigdhansu Chatterjee, Vipin Kumar code 5
Sparse Compositional Local Metric Learning Joseph St. Amand, Jun Huan code 5
Small Batch or Large Batch?: Gaussian Walk with Rebound Can Teach Peifeng Yin, Ping Luo, Taiga Nakamura code 5
Visualizing Attributed Graphs via Terrain Metaphor Yang Zhang, Yusu Wang, Srinivasan Parthasarathy code 5
Ad Serving with Multiple KPIs Brendan Kitts, Michael Krishnan, Ishadutta Yadav, Yongbo Zeng, Garrett Badeau, Andrew Potter, Sergey Tolkachov, Ethan Thornburg, Satyanarayana Reddy Janga code 5
Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes Toshimitsu Uesaka, Kai Morino, Hiroki Sugiura, Taichi Kiwaki, Hiroshi Murata, Ryo Asaoka, Kenji Yamanishi code 5
Predicting Optimal Facility Location without Customer Locations Emre Yilmaz, Sanem Elbasi, Hakan Ferhatosmanoglu code 5
REMIX: Automated Exploration for Interactive Outlier Detection Yanjie Fu, Charu C. Aggarwal, Srinivasan Parthasarathy, Deepak S. Turaga, Hui Xiong code 4
An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis Lujia Pan, Jianfeng Zhang, Patrick P. C. Lee, Hong Cheng, Cheng He, Caifeng He, Keli Zhang code 4
Dispatch with Confidence: Integration of Machine Learning, Optimization and Simulation for Open Pit Mines Kosta Ristovski, Chetan Gupta, Kunihiko Harada, HsiuKhuern Tang code 4
The Future of Data Integration Renée J. Miller code 4
What's Fair? Cynthia Dwork code 4
Designing AI at Scale to Power Everyday Life Rajesh Parekh code 4
The Future of Artificially Intelligent Assistants Muthu Muthukrishnan, Andrew Tomkins, Larry P. Heck, Alborz Geramifard, Deepak Agarwal code 4
Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods Suhang Wang, Charu C. Aggarwal, Huan Liu code 4
PNP: Fast Path Ensemble Method for Movie Design Danai Koutra, Abhilash Dighe, Smriti Bhagat, Udi Weinsberg, Stratis Ioannidis, Christos Faloutsos, Jean Bolot code 4
Internet Device Graphs Matthew Malloy, Paul Barford, Enis Ceyhun Alp, Jonathan Koller, Adria Jewell code 4
Improved Degree Bounds and Full Spectrum Power Laws in Preferential Attachment Networks Chen Avin, Zvi Lotker, Yinon Nahum, David Peleg code 3
HoORaYs: High-order Optimization of Rating Distance for Recommender Systems Jingwei Xu, Yuan Yao, Hanghang Tong, Xianping Tao, Jian Lu code 3
Behavior Informatics to Discover Behavior Insight for Active and Tailored Client Management Longbing Cao code 3
HyperLogLog Hyperextended: Sketches for Concave Sublinear Frequency Statistics Edith Cohen code 3
Coresets for Kernel Regression Yan Zheng, Jeff M. Phillips code 3
Let's See Your Digits: Anomalous-State Detection using Benford's Law Samuel Maurus, Claudia Plant code 3
Learning to Count Mosquitoes for the Sterile Insect Technique Yaniv Ovadia, Yoni Halpern, Dilip Krishnan, Josh Livni, Daniel Newburger, Ryan Poplin, Tiantian Zha, D. Sculley code 3
Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention Bin Tong, Martin Klinkigt, Makoto Iwayama, Toshihiko Yanase, Yoshiyuki Kobayashi, Anshuman Sahu, Ravigopal Vennelakanti code 3
Local Algorithm for User Action Prediction Towards Display Ads Hongxia Yang, Yada Zhu, Jingrui He code 2
Unsupervised P2P Rental Recommendations via Integer Programming Yanjie Fu, Guannan Liu, Mingfei Teng, Charu C. Aggarwal code 2
Multi-task Function-on-function Regression with Co-grouping Structured Sparsity Pei Yang, Qi Tan, Jingrui He code 2
Communication-Efficient Distributed Block Minimization for Nonlinear Kernel Machines ChoJui Hsieh, Si Si, Inderjit S. Dhillon code 2
Estimation of Recent Ancestral Origins of Individuals on a Large Scale Ross E. Curtis, Ahna Reza Girshick code 2
Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data Zhaobin Kuang, Peggy L. Peissig, Vítor Santos Costa, Richard Maclin, David Page code 2
Learning Tree-Structured Detection Cascades for Heterogeneous Networks of Embedded Devices Hamid Dadkhahi, Benjamin M. Marlin code 2
Three Principles of Data Science: Predictability, Stability and Computability Bin Yu code 2
Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? Usama M. Fayyad, Arno Candel, Eduardo Ariño de la Rubia, Szilárd Pafka, Anthony Chong, JeongYoon Lee code 2
Is the Whole Greater Than the Sum of Its Parts? Liangyue Li, Hanghang Tong, Yong Wang, Conglei Shi, Nan Cao, Norbou Buchler code 2
Construction of Directed 2K Graphs Bálint Tillman, Athina Markopoulou, Carter T. Butts, Minas Gjoka code 2
Automatic Application Identification from Billions of Files Kyle Soska, Christopher S. Gates, Kevin A. Roundy, Nicolas Christin code 2
Randomization or Condensation?: Linear-Cost Matrix Sketching Via Cascaded Compression Sampling Kai Zhang, Chuanren Liu, Jie Zhang, Hui Xiong, Eric P. Xing, Jieping Ye code 1
Spaceborne Data Enters the Mainstream David Potere code 1
Evaluating U.S. Electoral Representation with a Joint Statistical Model of Congressional Roll-Calls, Legislative Text, and Voter Registration Data Zhengming Xing, Sunshine Hillygus, Lawrence Carin code 1
GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources Saurav Ghosh, Prithwish Chakraborty, Bryan L. Lewis, Maimuna S. Majumder, Emily Cohn, John S. Brownstein, Madhav V. Marathe, Naren Ramakrishnan code 1
A Practical Algorithm for Solving the Incoherence Problem of Topic Models In Industrial Applications Amr Ahmed, James Long, Daniel Silva, Yuan Wang code 1
Industrial Machine Learning Josh Bloom code 0
Machine Learning Software in Practice: Quo Vadis? Szilárd Pafka code 0
SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis Qiming Huang, Michael Zhu code 0
Foreword to the Applied Data Science: Invited Talks Track at KDD-2017 Usama M. Fayyad, Evangelos Simoudis, Ashok Srivastava code 0
More than the Sum of its Parts: Building Domino Data Lab Eduardo Ariño de la Rubia code 0
Mining Big Data in NeuroGenetics to Understand Muscular Dystrophy Andy Berglund code 0
Planning and Learning under Uncertainty: Theory and Practice Jonathan P. How code 0
It Takes More than Math and Engineering to Hit the Bullseye with Data Paritosh Desai code 0
Addressing Challenges with Big Data for Media Measurement Mainak Mazumdar code 0
Mixture Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting GuoJun Qi, Jiliang Tang, Jingdong Wang, Jiebo Luo code 0