metapath2vec: Scalable Representation Learning for Heterogeneous Networks |
Yuxiao Dong, Nitesh V. Chawla, Ananthram Swami |
|
|
|
code |
1042 |
Anomaly Detection with Robust Deep Autoencoders |
Chong Zhou, Randy C. Paffenroth |
|
|
|
code |
542 |
struc2vec: Learning Node Representations from Structural Identity |
Leonardo Filipe Rodrigues Ribeiro, Pedro H. P. Saverese, Daniel R. Figueiredo |
|
|
|
code |
471 |
Algorithmic Decision Making and the Cost of Fairness |
Sam CorbettDavies, Emma Pierson, Avi Feller, Sharad Goel, Aziz Huq |
|
|
|
code |
349 |
Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks |
Huan Zhao, Quanming Yao, Jianda Li, Yangqiu Song, Dik Lun Lee |
|
|
|
code |
289 |
GRAM: Graph-based Attention Model for Healthcare Representation Learning |
Edward Choi, Mohammad Taha Bahadori, Le Song, Walter F. Stewart, Jimeng Sun |
|
|
|
code |
271 |
Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks |
Fenglong Ma, Radha Chitta, Jing Zhou, Quanzeng You, Tong Sun, Jing Gao |
|
|
|
code |
256 |
Google Vizier: A Service for Black-Box Optimization |
Daniel Golovin, Benjamin Solnik, Subhodeep Moitra, Greg Kochanski, John Karro, D. Sculley |
|
|
|
code |
239 |
Patient Subtyping via Time-Aware LSTM Networks |
Inci M. Baytas, Cao Xiao, Xi Zhang, Fei Wang, Anil K. Jain, Jiayu Zhou |
|
|
|
code |
236 |
Local Higher-Order Graph Clustering |
Hao Yin, Austin R. Benson, Jure Leskovec, David F. Gleich |
|
|
|
code |
236 |
Embedding-based News Recommendation for Millions of Users |
Shumpei Okura, Yukihiro Tagami, Shingo Ono, Akira Tajima |
|
|
|
code |
211 |
Collaborative Variational Autoencoder for Recommender Systems |
Xiaopeng Li, James She |
|
|
|
code |
209 |
Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation |
Carl Yang, Lanxiao Bai, Chao Zhang, Quan Yuan, Jiawei Han |
|
|
|
code |
182 |
The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms |
Yongxin Tong, Yuqiang Chen, Zimu Zhou, Lei Chen, Jie Wang, Qiang Yang, Jieping Ye, Weifeng Lv |
|
|
|
code |
175 |
Stock Price Prediction via Discovering Multi-Frequency Trading Patterns |
Liheng Zhang, Charu C. Aggarwal, GuoJun Qi |
|
|
|
code |
143 |
TFX: A TensorFlow-Based Production-Scale Machine Learning Platform |
Denis Baylor, Eric Breck, HengTze Cheng, Noah Fiedel, Chuan Yu Foo, Zakaria Haque, Salem Haykal, Mustafa Ispir, Vihan Jain, Levent Koc, Chiu Yuen Koo, Lukasz Lew, Clemens Mewald, Akshay Naresh Modi, Neoklis Polyzotis, Sukriti Ramesh, Sudip Roy, Steven Euijong Whang, Martin Wicke, Jarek Wilkiewicz, Xin Zhang, Martin Zinkevich |
|
|
|
code |
142 |
Anomaly Detection in Streams with Extreme Value Theory |
Alban Siffer, PierreAlain Fouque, Alexandre Termier, Christine Largouët |
|
|
|
code |
139 |
HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network |
Shifu Hou, Yanfang Ye, Yangqiu Song, Melih Abdulhayoglu |
|
|
|
code |
135 |
A Taxi Order Dispatch Model based On Combinatorial Optimization |
Lingyu Zhang, Tao Hu, Yue Min, Guobin Wu, Junying Zhang, Pengcheng Feng, Pinghua Gong, Jieping Ye |
|
|
|
code |
128 |
Machine Learning for Encrypted Malware Traffic Classification: Accounting for Noisy Labels and Non-Stationarity |
Blake Anderson, David A. McGrew |
|
|
|
code |
123 |
Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster |
Naeemul Hassan, Fatma Arslan, Chengkai Li, Mark Tremayne |
|
|
|
code |
118 |
Planning Bike Lanes based on Sharing-Bikes' Trajectories |
Jie Bao, Tianfu He, Sijie Ruan, Yanhua Li, Yu Zheng |
|
|
|
code |
116 |
Learning from Multiple Teacher Networks |
Shan You, Chang Xu, Chao Xu, Dacheng Tao |
|
|
|
code |
108 |
Aspect Based Recommendations: Recommending Items with the Most Valuable Aspects Based on User Reviews |
Konstantin Bauman, Bing Liu, Alexander Tuzhilin |
|
|
|
code |
96 |
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data |
David Hallac, Sagar Vare, Stephen P. Boyd, Jure Leskovec |
|
|
|
code |
93 |
Weisfeiler-Lehman Neural Machine for Link Prediction |
Muhan Zhang, Yixin Chen |
|
|
|
code |
88 |
Dynamic Attention Deep Model for Article Recommendation by Learning Human Editors' Demonstration |
Xuejian Wang, Lantao Yu, Kan Ren, Guanyu Tao, Weinan Zhang, Yong Yu, Jun Wang |
|
|
|
code |
83 |
TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams |
Chao Zhang, Liyuan Liu, Dongming Lei, Quan Yuan, Honglei Zhuang, Tim Hanratty, Jiawei Han |
|
|
|
code |
80 |
DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection |
Bokai Cao, Lei Zheng, Chenwei Zhang, Philip S. Yu, Andrea Piscitello, John Zulueta, Olu Ajilore, Kelly Ryan, Alex D. Leow |
|
|
|
code |
80 |
DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution |
Thomas Vandal, Evan Kodra, Sangram Ganguly, Andrew R. Michaelis, Ramakrishna R. Nemani, Auroop R. Ganguly |
|
|
|
code |
79 |
ReasoNet: Learning to Stop Reading in Machine Comprehension |
Yelong Shen, PoSen Huang, Jianfeng Gao, Weizhu Chen |
|
|
|
code |
78 |
Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale |
Adrian Albert, Jasleen Kaur, Marta C. González |
|
|
|
code |
77 |
Network Inference via the Time-Varying Graphical Lasso |
David Hallac, Youngsuk Park, Stephen P. Boyd, Jure Leskovec |
|
|
|
code |
73 |
A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments |
Pavel A. Dmitriev, Somit Gupta, Dong Woo Kim, Garnet Jason Vaz |
|
|
|
code |
70 |
Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking |
Gabriele Tolomei, Fabrizio Silvestri, Andrew Haines, Mounia Lalmas |
|
|
|
code |
69 |
Adversary Resistant Deep Neural Networks with an Application to Malware Detection |
Qinglong Wang, Wenbo Guo, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles |
|
|
|
code |
69 |
On Sampling Strategies for Neural Network-based Collaborative Filtering |
Ting Chen, Yizhou Sun, Yue Shi, Liangjie Hong |
|
|
|
code |
67 |
AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification |
Yukihiro Tagami |
|
|
|
code |
61 |
LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity |
Yutao Zhang, Robert Chen, Jie Tang, Walter F. Stewart, Jimeng Sun |
|
|
|
code |
58 |
Optimized Cost per Click in Taobao Display Advertising |
Han Zhu, Junqi Jin, Chang Tan, Fei Pan, Yifan Zeng, Han Li, Kun Gai |
|
|
|
code |
52 |
Not All Passes Are Created Equal: Objectively Measuring the Risk and Reward of Passes in Soccer from Tracking Data |
Paul Power, Héctor Ruiz, Xinyu Wei, Patrick Lucey |
|
|
|
code |
52 |
Scalable and Sustainable Deep Learning via Randomized Hashing |
Ryan Spring, Anshumali Shrivastava |
|
|
|
code |
50 |
FORA: Simple and Effective Approximate Single-Source Personalized PageRank |
Sibo Wang, Renchi Yang, Xiaokui Xiao, Zhewei Wei, Yin Yang |
|
|
|
code |
49 |
Visual Search at eBay |
Fan Yang, Ajinkya Kale, Yury Bubnov, Leon Stein, Qiaosong Wang, M. Hadi Kiapour, Robinson Piramuthu |
|
|
|
code |
48 |
Point-of-Interest Demand Modeling with Human Mobility Patterns |
Yanchi Liu, Chuanren Liu, Xinjiang Lu, Mingfei Teng, Hengshu Zhu, Hui Xiong |
|
|
|
code |
48 |
Cascade Ranking for Operational E-commerce Search |
Shichen Liu, Fei Xiao, Wenwu Ou, Luo Si |
|
|
|
code |
45 |
The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables |
Himabindu Lakkaraju, Jon M. Kleinberg, Jure Leskovec, Jens Ludwig, Sendhil Mullainathan |
|
|
|
code |
45 |
Extremely Fast Decision Tree Mining for Evolving Data Streams |
Albert Bifet, Jiajin Zhang, Wei Fan, Cheng He, Jianfeng Zhang, Jianfeng Qian, Geoff Holmes, Bernhard Pfahringer |
|
|
|
code |
44 |
Federated Tensor Factorization for Computational Phenotyping |
Yejin Kim, Jimeng Sun, Hwanjo Yu, Xiaoqian Jiang |
|
|
|
code |
44 |
When is a Network a Network?: Multi-Order Graphical Model Selection in Pathways and Temporal Networks |
Ingo Scholtes |
|
|
|
code |
44 |
A Location-Sentiment-Aware Recommender System for Both Home-Town and Out-of-Town Users |
Hao Wang, Yanmei Fu, Qinyong Wang, Hongzhi Yin, Changying Du, Hui Xiong |
|
|
|
code |
42 |
A Hybrid Framework for Text Modeling with Convolutional RNN |
Chenglong Wang, Feijun Jiang, Hongxia Yang |
|
|
|
code |
42 |
PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification |
Ian EnHsu Yen, Xiangru Huang, Wei Dai, Pradeep Ravikumar, Inderjit S. Dhillon, Eric P. Xing |
|
|
|
code |
41 |
Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion |
Junming Liu, Leilei Sun, Qiao Li, Jingci Ming, Yanchi Liu, Hui Xiong |
|
|
|
code |
41 |
Discrete Content-aware Matrix Factorization |
Defu Lian, Rui Liu, Yong Ge, Kai Zheng, Xing Xie, Longbing Cao |
|
|
|
code |
39 |
A Local Algorithm for Structure-Preserving Graph Cut |
Dawei Zhou, Si Zhang, Mehmet Yigit Yildirim, Scott Alcorn, Hanghang Tong, Hasan Davulcu, Jingrui He |
|
|
|
code |
39 |
Inductive Semi-supervised Multi-Label Learning with Co-Training |
Wang Zhan, MinLing Zhang |
|
|
|
code |
38 |
Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters |
Alessandro Epasto, Silvio Lattanzi, Renato Paes Leme |
|
|
|
code |
37 |
MetaPAD: Meta Pattern Discovery from Massive Text Corpora |
Meng Jiang, Jingbo Shang, Taylor Cassidy, Xiang Ren, Lance M. Kaplan, Timothy P. Hanratty, Jiawei Han |
|
|
|
code |
37 |
Prospecting the Career Development of Talents: A Survival Analysis Perspective |
Huayu Li, Yong Ge, Hengshu Zhu, Hui Xiong, Hongke Zhao |
|
|
|
code |
37 |
Backpage and Bitcoin: Uncovering Human Traffickers |
Rebecca S. Portnoff, Danny Yuxing Huang, Periwinkle Doerfler, Sadia Afroz, Damon McCoy |
|
|
|
code |
37 |
Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes |
Pengfei Wang, Yanjie Fu, Guannan Liu, Wenqing Hu, Charu C. Aggarwal |
|
|
|
code |
36 |
Customer Lifetime Value Prediction Using Embeddings |
Benjamin Paul Chamberlain, Ângelo Cardoso, C. H. Bryan Liu, Roberto Pagliari, Marc Peter Deisenroth |
|
|
|
code |
35 |
An Efficient Bandit Algorithm for Realtime Multivariate Optimization |
Daniel N. Hill, Houssam Nassif, Yi Liu, Anand Iyer, S. V. N. Vishwanathan |
|
|
|
code |
35 |
KATE: K-Competitive Autoencoder for Text |
Yu Chen, Mohammed J. Zaki |
|
|
|
code |
35 |
A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations |
Yuxiao Dong, Hao Ma, Zhihong Shen, Kuansan Wang |
|
|
|
code |
35 |
Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams |
Junming Liu, Yanjie Fu, Jingci Ming, Yong Ren, Leilei Sun, Hui Xiong |
|
|
|
code |
34 |
Graph Edge Partitioning via Neighborhood Heuristic |
Chenzi Zhang, Fan Wei, Qin Liu, Zhihao Gavin Tang, Zhenguo Li |
|
|
|
code |
34 |
Peeking at A/B Tests: Why it matters, and what to do about it |
Ramesh Johari, Pete Koomen, Leonid Pekelis, David Walsh |
|
|
|
code |
33 |
A Hierarchical Algorithm for Extreme Clustering |
Ari Kobren, Nicholas Monath, Akshay Krishnamurthy, Andrew McCallum |
|
|
|
code |
33 |
A Context-aware Attention Network for Interactive Question Answering |
Huayu Li, Martin Renqiang Min, Yong Ge, Asim Kadav |
|
|
|
code |
33 |
An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance |
Edward Raff, Charles K. Nicholas |
|
|
|
code |
32 |
Robust Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves Latent Embeddings |
Aleksandar Bojchevski, Yves Matkovic, Stephan Günnemann |
|
|
|
code |
31 |
Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates |
Liyang Xie, Inci M. Baytas, Kaixiang Lin, Jiayu Zhou |
|
|
|
code |
31 |
Multi-Aspect Streaming Tensor Completion |
Qingquan Song, Xiao Huang, Hancheng Ge, James Caverlee, Xia Hu |
|
|
|
code |
31 |
Learning Certifiably Optimal Rule Lists |
Elaine Angelino, Nicholas LarusStone, Daniel Alabi, Margo I. Seltzer, Cynthia Rudin |
|
|
|
code |
31 |
Structural Deep Brain Network Mining |
Shen Wang, Lifang He, Bokai Cao, ChunTa Lu, Philip S. Yu, Ann B. Ragin |
|
|
|
code |
31 |
FIRST: Fast Interactive Attributed Subgraph Matching |
Boxin Du, Si Zhang, Nan Cao, Hanghang Tong |
|
|
|
code |
30 |
Compass: Spatio Temporal Sentiment Analysis of US Election What Twitter Says! |
Debjyoti Paul, Feifei Li, Murali Krishna Teja, Xin Yu, Richie Frost |
|
|
|
code |
30 |
Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources |
Fenglong Ma, Chuishi Meng, Houping Xiao, Qi Li, Jing Gao, Lu Su, Aidong Zhang |
|
|
|
code |
30 |
Matrix Profile V: A Generic Technique to Incorporate Domain Knowledge into Motif Discovery |
Hoang Anh Dau, Eamonn J. Keogh |
|
|
|
code |
29 |
DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams |
Kijung Shin, Bryan Hooi, Jisu Kim, Christos Faloutsos |
|
|
|
code |
29 |
Fast Enumeration of Large k-Plexes |
Alessio Conte, Donatella Firmani, Caterina Mordente, Maurizio Patrignani, Riccardo Torlone |
|
|
|
code |
29 |
SPARTan: Scalable PARAFAC2 for Large & Sparse Data |
Ioakeim Perros, Evangelos E. Papalexakis, Fei Wang, Richard W. Vuduc, Elizabeth Searles, Michael Thompson, Jimeng Sun |
|
|
|
code |
29 |
Collaboratively Improving Topic Discovery and Word Embeddings by Coordinating Global and Local Contexts |
Guangxu Xun, Yaliang Li, Jing Gao, Aidong Zhang |
|
|
|
code |
27 |
EmbedJoin: Efficient Edit Similarity Joins via Embeddings |
Haoyu Zhang, Qin Zhang |
|
|
|
code |
27 |
MOLIERE: Automatic Biomedical Hypothesis Generation System |
Justin Sybrandt, Michael Shtutman, Ilya Safro |
|
|
|
code |
27 |
KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial |
Jun Zhou, Xiaolong Li, Peilin Zhao, Chaochao Chen, Longfei Li, Xinxing Yang, Qing Cui, Jin Yu, Xu Chen, Yi Ding, Yuan (Alan) Qi |
|
|
|
code |
26 |
Post Processing Recommender Systems for Diversity |
Arda Antikacioglu, R. Ravi |
|
|
|
code |
25 |
Prognosis and Diagnosis of Parkinson's Disease Using Multi-Task Learning |
Saba Emrani, Anya McGuirk, Wei Xiao |
|
|
|
code |
25 |
Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing |
Kun Kuang, Peng Cui, Bo Li, Meng Jiang, Shiqiang Yang |
|
|
|
code |
25 |
Unsupervised Feature Selection in Signed Social Networks |
Kewei Cheng, Jundong Li, Huan Liu |
|
|
|
code |
25 |
Detecting Network Effects: Randomizing Over Randomized Experiments |
Martin Saveski, Jean PougetAbadie, Guillaume SaintJacques, Weitao Duan, Souvik Ghosh, Ya Xu, Edoardo M. Airoldi |
|
|
|
code |
25 |
Contextual Spatial Outlier Detection with Metric Learning |
Guanjie Zheng, Susan L. Brantley, Thomas Lauvaux, Zhenhui Li |
|
|
|
code |
25 |
Optimized Risk Scores |
Berk Ustun, Cynthia Rudin |
|
|
|
code |
24 |
FLAP: An End-to-End Event Log Analysis Platform for System Management |
Tao Li, Yexi Jiang, Chunqiu Zeng, Bin Xia, Zheng Liu, Wubai Zhou, Xiaolong Zhu, Wentao Wang, Liang Zhang, Jun Wu, Li Xue, Dewei Bao |
|
|
|
code |
24 |
Developing a Comprehensive Framework for Multimodal Feature Extraction |
Quinten McNamara, Alejandro de la Vega, Tal Yarkoni |
|
|
|
code |
24 |
Efficient Correlated Topic Modeling with Topic Embedding |
Junxian He, Zhiting Hu, Taylor BergKirkpatrick, Ying Huang, Eric P. Xing |
|
|
|
code |
23 |
Discovering Reliable Approximate Functional Dependencies |
Panagiotis Mandros, Mario Boley, Jilles Vreeken |
|
|
|
code |
23 |
Distributed Multi-Task Relationship Learning |
Sulin Liu, Sinno Jialin Pan, Qirong Ho |
|
|
|
code |
23 |
Deep Embedding Forest: Forest-based Serving with Deep Embedding Features |
Jie Zhu, Ying Shan, J. C. Mao, Dong Yu, Holakou Rahmanian, Yi Zhang |
|
|
|
code |
23 |
Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers |
Aman Agarwal, Soumya Basu, Tobias Schnabel, Thorsten Joachims |
|
|
|
code |
23 |
Accelerating Innovation Through Analogy Mining |
Tom Hope, Joel Chan, Aniket Kittur, Dafna Shahaf |
|
|
|
code |
23 |
Incremental Dual-memory LSTM in Land Cover Prediction |
Xiaowei Jia, Ankush Khandelwal, Guruprasad Nayak, James Gerber, Kimberly Carlson, Paul C. West, Vipin Kumar |
|
|
|
code |
23 |
Automatic Synonym Discovery with Knowledge Bases |
Meng Qu, Xiang Ren, Jiawei Han |
|
|
|
code |
23 |
Achieving Non-Discrimination in Data Release |
Lu Zhang, Yongkai Wu, Xintao Wu |
|
|
|
code |
22 |
Deep Choice Model Using Pointer Networks for Airline Itinerary Prediction |
Alejandro Mottini, Rodrigo AcunaAgost |
|
|
|
code |
22 |
Automated Categorization of Onion Sites for Analyzing the Darkweb Ecosystem |
Shalini Ghosh, Ariyam Das, Phillip A. Porras, Vinod Yegneswaran, Ashish Gehani |
|
|
|
code |
22 |
PAMAE: Parallel k-Medoids Clustering with High Accuracy and Efficiency |
Hwanjun Song, JaeGil Lee, WookShin Han |
|
|
|
code |
21 |
DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks |
Zi Yin, Kenghao Chang, Ruofei Zhang |
|
|
|
code |
21 |
Linearized GMM Kernels and Normalized Random Fourier Features |
Ping Li |
|
|
|
code |
20 |
STAR: A System for Ticket Analysis and Resolution |
Wubai Zhou, Wei Xue, Ramesh Baral, Qing Wang, Chunqiu Zeng, Tao Li, Jian Xu, Zheng Liu, Larisa Shwartz, Genady Ya. Grabarnik |
|
|
|
code |
20 |
Resolving the Bias in Electronic Medical Records |
Kaiping Zheng, Jinyang Gao, Kee Yuan Ngiam, Beng Chin Ooi, James Wei Luen Yip |
|
|
|
code |
19 |
Clustering Individual Transactional Data for Masses of Users |
Riccardo Guidotti, Anna Monreale, Mirco Nanni, Fosca Giannotti, Dino Pedreschi |
|
|
|
code |
19 |
No Longer Sleeping with a Bomb: A Duet System for Protecting Urban Safety from Dangerous Goods |
Jingyuan Wang, Chao Chen, Junjie Wu, Zhang Xiong |
|
|
|
code |
19 |
Large Scale Sentiment Learning with Limited Labels |
Vasileios Iosifidis, Eirini Ntoutsi |
|
|
|
code |
19 |
Predicting Clinical Outcomes Across Changing Electronic Health Record Systems |
Jen J. Gong, Tristan Naumann, Peter Szolovits, John V. Guttag |
|
|
|
code |
18 |
Multi-Modality Disease Modeling via Collective Deep Matrix Factorization |
Qi Wang, Mengying Sun, Liang Zhan, Paul Thompson, Shuiwang Ji, Jiayu Zhou |
|
|
|
code |
17 |
LiJAR: A System for Job Application Redistribution towards Efficient Career Marketplace |
Fedor Borisyuk, Liang Zhang, Krishnaram Kenthapadi |
|
|
|
code |
17 |
Discovering Pollution Sources and Propagation Patterns in Urban Area |
Xiucheng Li, Yun Cheng, Gao Cong, Lisi Chen |
|
|
|
code |
17 |
PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks |
Yu Shi, PoWei Chan, Honglei Zhuang, Huan Gui, Jiawei Han |
|
|
|
code |
16 |
On Finding Socially Tenuous Groups for Online Social Networks |
ChihYa Shen, LiangHao Huang, DeNian Yang, HongHan Shuai, WangChien Lee, MingSyan Chen |
|
|
|
code |
16 |
Formative Essay Feedback Using Predictive Scoring Models |
Bronwyn Woods, David Adamson, Shayne Miel, Elijah Mayfield |
|
|
|
code |
16 |
Quick Access: Building a Smart Experience for Google Drive |
Sandeep Tata, Alexandrin Popescul, Marc Najork, Mike Colagrosso, Julian Gibbons, Alan Green, Alexandre Mah, Michael Smith, Divanshu Garg, Cayden Meyer, Reuben Kan |
|
|
|
code |
16 |
Revisiting Power-law Distributions in Spectra of Real World Networks |
Nicole Eikmeier, David F. Gleich |
|
|
|
code |
15 |
Recurrent Poisson Factorization for Temporal Recommendation |
Seyyed Abbas Hosseini, Keivan Alizadeh, Ali Khodadadi, Ali Arabzadeh, Mehrdad Farajtabar, Hongyuan Zha, Hamid R. Rabiee |
|
|
|
code |
15 |
Optimization Beyond Prediction: Prescriptive Price Optimization |
Shinji Ito, Ryohei Fujimaki |
|
|
|
code |
15 |
A Minimal Variance Estimator for the Cardinality of Big Data Set Intersection |
Reuven Cohen, Liran Katzir, Aviv Yehezkel |
|
|
|
code |
14 |
Unsupervised Network Discovery for Brain Imaging Data |
Zilong Bai, Peter B. Walker, Anna E. Tschiffely, Fei Wang, Ian Davidson |
|
|
|
code |
14 |
Semi-Supervised Techniques for Mining Learning Outcomes and Prerequisites |
Igor Labutov, Yun Huang, Peter Brusilovsky, Daqing He |
|
|
|
code |
14 |
Convex Factorization Machine for Toxicogenomics Prediction |
Makoto Yamada, Wenzhao Lian, Amit Goyal, Jianhui Chen, Kishan Wimalawarne, Suleiman A. Khan, Samuel Kaski, Hiroshi Mamitsuka, Yi Chang |
|
|
|
code |
14 |
Distributed Local Outlier Detection in Big Data |
Yizhou Yan, Lei Cao, Caitlin Kuhlman, Elke A. Rundensteiner |
|
|
|
code |
14 |
Similarity Forests |
Saket Sathe, Charu C. Aggarwal |
|
|
|
code |
13 |
Scalable Top-n Local Outlier Detection |
Yizhou Yan, Lei Cao, Elke A. Rundensteiner |
|
|
|
code |
13 |
Long Short Memory Process: Modeling Growth Dynamics of Microscopic Social Connectivity |
Chengxi Zang, Peng Cui, Christos Faloutsos, Wenwu Zhu |
|
|
|
code |
13 |
TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks |
HengTze Cheng, Zakaria Haque, Lichan Hong, Mustafa Ispir, Clemens Mewald, Illia Polosukhin, Georgios Roumpos, D. Sculley, Jamie Smith, David Soergel, Yuan Tang, Philipp Tucker, Martin Wicke, Cassandra Xia, Jianwei Xie |
|
|
|
code |
12 |
Towards an Optimal Subspace for K-Means |
Dominik Mautz, Wei Ye, Claudia Plant, Christian Böhm |
|
|
|
code |
12 |
Statistical Emerging Pattern Mining with Multiple Testing Correction |
Junpei Komiyama, Masakazu Ishihata, Hiroki Arimura, Takashi Nishibayashi, Shinichi Minato |
|
|
|
code |
12 |
"The Leicester City Fairytale?": Utilizing New Soccer Analytics Tools to Compare Performance in the 15/16 & 16/17 EPL Seasons |
Héctor Ruiz, Paul Power, Xinyu Wei, Patrick Lucey |
|
|
|
code |
12 |
The Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue |
Ashlesh Sharma, Vidyuth Srinivasan, Vishal Kanchan, Lakshminarayanan Subramanian |
|
|
|
code |
11 |
Tracking the Dynamics in Crowdfunding |
Hongke Zhao, Hefu Zhang, Yong Ge, Qi Liu, Enhong Chen, Huayu Li, Le Wu |
|
|
|
code |
11 |
Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks |
Yuxiao Dong, Reid A. Johnson, Jian Xu, Nitesh V. Chawla |
|
|
|
code |
11 |
Structural Event Detection from Log Messages |
Fei Wu, Pranay Anchuri, Zhenhui Li |
|
|
|
code |
11 |
A Data Mining Framework for Valuing Large Portfolios of Variable Annuities |
Guojun Gan, Jimmy Xiangji Huang |
|
|
|
code |
11 |
MARAS: Signaling Multi-Drug Adverse Reactions |
Xiao Qin, Tabassum Kakar, Susmitha Wunnava, Elke A. Rundensteiner, Lei Cao |
|
|
|
code |
11 |
Finding Precursors to Anomalous Drop in Airspeed During a Flight's Takeoff |
Vijay Manikandan Janakiraman, Bryan L. Matthews, Nikunj C. Oza |
|
|
|
code |
11 |
A Data-driven Process Recommender Framework |
Sen Yang, Xin Dong, Leilei Sun, Yichen Zhou, Richard A. Farneth, Hui Xiong, Randall S. Burd, Ivan Marsic |
|
|
|
code |
10 |
Learning Temporal State of Diabetes Patients via Combining Behavioral and Demographic Data |
Houping Xiao, Jing Gao, Long H. Vu, Deepak S. Turaga |
|
|
|
code |
10 |
Functional Annotation of Human Protein Coding Isoforms via Non-convex Multi-Instance Learning |
Tingjin Luo, Weizhong Zhang, Shang Qiu, Yang Yang, Dongyun Yi, Guangtao Wang, Jieping Ye, Jie Wang |
|
|
|
code |
10 |
Inferring the Strength of Social Ties: A Community-Driven Approach |
Polina Rozenshtein, Nikolaj Tatti, Aristides Gionis |
|
|
|
code |
10 |
Luck is Hard to Beat: The Difficulty of Sports Prediction |
Raquel Y. S. Aoki, Renato Martins Assunção, Pedro O. S. Vaz de Melo |
|
|
|
code |
10 |
A Data Science Approach to Understanding Residential Water Contamination in Flint |
Alex Chojnacki, Chengyu Dai, Arya Farahi, Guangsha Shi, Jared Webb, Daniel T. Zhang, Jacob D. Abernethy, Eric M. Schwartz |
|
|
|
code |
10 |
Anarchists, Unite: Practical Entropy Approximation for Distributed Streams |
Moshe Gabel, Daniel Keren, Assaf Schuster |
|
|
|
code |
9 |
Bolt: Accelerated Data Mining with Fast Vector Compression |
Davis W. Blalock, John V. Guttag |
|
|
|
code |
9 |
End-to-end Learning for Short Text Expansion |
Jian Tang, Yue Wang, Kai Zheng, Qiaozhu Mei |
|
|
|
code |
9 |
RUSH!: Targeted Time-limited Coupons via Purchase Forecasts |
Emaad A. Manzoor, Leman Akoglu |
|
|
|
code |
9 |
Retrospective Higher-Order Markov Processes for User Trails |
Tao Wu, David F. Gleich |
|
|
|
code |
8 |
Deep Design: Product Aesthetics for Heterogeneous Markets |
Yanxin Pan, Alexander Burnap, Jeffrey Hartley, Richard Gonzalez, Panos Y. Papalambros |
|
|
|
code |
8 |
Robust Top-k Multiclass SVM for Visual Category Recognition |
Xiaojun Chang, Yaoliang Yu, Yi Yang |
|
|
|
code |
8 |
The Co-Evolution Model for Social Network Evolving and Opinion Migration |
Yupeng Gu, Yizhou Sun, Jianxi Gao |
|
|
|
code |
8 |
BDT: Gradient Boosted Decision Tables for High Accuracy and Scoring Efficiency |
Yin Lou, Mikhail Obukhov |
|
|
|
code |
8 |
Big Data in Climate: Opportunities and Challenges for Machine Learning |
Anuj Karpatne, Vipin Kumar |
|
|
|
code |
7 |
A Temporally Heterogeneous Survival Framework with Application to Social Behavior Dynamics |
Linyun Yu, Peng Cui, Chaoming Song, Tianyang Zhang, Shiqiang Yang |
|
|
|
code |
7 |
Collecting and Analyzing Millions of mHealth Data Streams |
Tom Quisel, Luca Foschini, Alessio Signorini, David C. Kale |
|
|
|
code |
7 |
Constructivism Learning: A Learning Paradigm for Transparent Predictive Analytics |
Xiaoli Li, Jun Huan |
|
|
|
code |
7 |
Relay-Linking Models for Prominence and Obsolescence in Evolving Networks |
Mayank Singh, Rajdeep Sarkar, Pawan Goyal, Animesh Mukherjee, Soumen Chakrabarti |
|
|
|
code |
7 |
Decomposed Normalized Maximum Likelihood Codelength Criterion for Selecting Hierarchical Latent Variable Models |
Tianyi Wu, Shinya Sugawara, Kenji Yamanishi |
|
|
|
code |
7 |
Supporting Employer Name Normalization at both Entity and Cluster Level |
Qiaoling Liu, Faizan Javed, Vachik S. Dave, Ankita Joshi |
|
|
|
code |
7 |
Contextual Motifs: Increasing the Utility of Motifs using Contextual Data |
Ian Fox, Lynn Ang, Mamta Jaiswal, Rodica PopBusui, Jenna Wiens |
|
|
|
code |
6 |
Groups-Keeping Solution Path Algorithm for Sparse Regression with Automatic Feature Grouping |
Bin Gu, Guodong Liu, Heng Huang |
|
|
|
code |
6 |
Fast Newton Hard Thresholding Pursuit for Sparsity Constrained Nonconvex Optimization |
Jinghui Chen, Quanquan Gu |
|
|
|
code |
6 |
Learning from Labeled and Unlabeled Vertices in Networks |
Wei Ye, Linfei Zhou, Dominik Mautz, Claudia Plant, Christian Böhm |
|
|
|
code |
6 |
Real-Time Optimization of Web Publisher RTB Revenues |
Pedro Chahuara, Nicolas Grislain, Grégoire Jauvion, JeanMichel Renders |
|
|
|
code |
6 |
AESOP: Automatic Policy Learning for Predicting and Mitigating Network Service Impairments |
Supratim Deb, Zihui Ge, Sastry Isukapalli, Sarat C. Puthenpura, Shobha Venkataraman, He Yan, Jennifer Yates |
|
|
|
code |
6 |
Large-scale Collaborative Ranking in Near-Linear Time |
Liwei Wu, ChoJui Hsieh, James Sharpnack |
|
|
|
code |
5 |
Online Ranking with Constraints: A Primal-Dual Algorithm and Applications to Web Traffic-Shaping |
Parikshit Shah, Akshay Soni, Troy Chevalier |
|
|
|
code |
5 |
A Practical Exploration System for Search Advertising |
Parikshit Shah, Ming Yang, Sachidanand Alle, Adwait Ratnaparkhi, Ben Shahshahani, Rohit Chandra |
|
|
|
code |
5 |
Matching Restaurant Menus to Crowdsourced Food Data: A Scalable Machine Learning Approach |
Hesam Salehian, Patrick D. Howell, Chul Lee |
|
|
|
code |
5 |
Discovering Enterprise Concepts Using Spreadsheet Tables |
Keqian Li, Yeye He, Kris Ganjam |
|
|
|
code |
5 |
A Quasi-experimental Estimate of the Impact of P2P Transportation Platforms on Urban Consumer Patterns |
Zhe Zhang, Beibei Li |
|
|
|
code |
5 |
Tripoles: A New Class of Relationships in Time Series Data |
Saurabh Agrawal, Gowtham Atluri, Anuj Karpatne, William Haltom, Stefan Liess, Snigdhansu Chatterjee, Vipin Kumar |
|
|
|
code |
5 |
Sparse Compositional Local Metric Learning |
Joseph St. Amand, Jun Huan |
|
|
|
code |
5 |
Small Batch or Large Batch?: Gaussian Walk with Rebound Can Teach |
Peifeng Yin, Ping Luo, Taiga Nakamura |
|
|
|
code |
5 |
Visualizing Attributed Graphs via Terrain Metaphor |
Yang Zhang, Yusu Wang, Srinivasan Parthasarathy |
|
|
|
code |
5 |
Ad Serving with Multiple KPIs |
Brendan Kitts, Michael Krishnan, Ishadutta Yadav, Yongbo Zeng, Garrett Badeau, Andrew Potter, Sergey Tolkachov, Ethan Thornburg, Satyanarayana Reddy Janga |
|
|
|
code |
5 |
Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes |
Toshimitsu Uesaka, Kai Morino, Hiroki Sugiura, Taichi Kiwaki, Hiroshi Murata, Ryo Asaoka, Kenji Yamanishi |
|
|
|
code |
5 |
Predicting Optimal Facility Location without Customer Locations |
Emre Yilmaz, Sanem Elbasi, Hakan Ferhatosmanoglu |
|
|
|
code |
5 |
REMIX: Automated Exploration for Interactive Outlier Detection |
Yanjie Fu, Charu C. Aggarwal, Srinivasan Parthasarathy, Deepak S. Turaga, Hui Xiong |
|
|
|
code |
4 |
An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis |
Lujia Pan, Jianfeng Zhang, Patrick P. C. Lee, Hong Cheng, Cheng He, Caifeng He, Keli Zhang |
|
|
|
code |
4 |
Dispatch with Confidence: Integration of Machine Learning, Optimization and Simulation for Open Pit Mines |
Kosta Ristovski, Chetan Gupta, Kunihiko Harada, HsiuKhuern Tang |
|
|
|
code |
4 |
The Future of Data Integration |
Renée J. Miller |
|
|
|
code |
4 |
What's Fair? |
Cynthia Dwork |
|
|
|
code |
4 |
Designing AI at Scale to Power Everyday Life |
Rajesh Parekh |
|
|
|
code |
4 |
The Future of Artificially Intelligent Assistants |
Muthu Muthukrishnan, Andrew Tomkins, Larry P. Heck, Alborz Geramifard, Deepak Agarwal |
|
|
|
code |
4 |
Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods |
Suhang Wang, Charu C. Aggarwal, Huan Liu |
|
|
|
code |
4 |
PNP: Fast Path Ensemble Method for Movie Design |
Danai Koutra, Abhilash Dighe, Smriti Bhagat, Udi Weinsberg, Stratis Ioannidis, Christos Faloutsos, Jean Bolot |
|
|
|
code |
4 |
Internet Device Graphs |
Matthew Malloy, Paul Barford, Enis Ceyhun Alp, Jonathan Koller, Adria Jewell |
|
|
|
code |
4 |
Improved Degree Bounds and Full Spectrum Power Laws in Preferential Attachment Networks |
Chen Avin, Zvi Lotker, Yinon Nahum, David Peleg |
|
|
|
code |
3 |
HoORaYs: High-order Optimization of Rating Distance for Recommender Systems |
Jingwei Xu, Yuan Yao, Hanghang Tong, Xianping Tao, Jian Lu |
|
|
|
code |
3 |
Behavior Informatics to Discover Behavior Insight for Active and Tailored Client Management |
Longbing Cao |
|
|
|
code |
3 |
HyperLogLog Hyperextended: Sketches for Concave Sublinear Frequency Statistics |
Edith Cohen |
|
|
|
code |
3 |
Coresets for Kernel Regression |
Yan Zheng, Jeff M. Phillips |
|
|
|
code |
3 |
Let's See Your Digits: Anomalous-State Detection using Benford's Law |
Samuel Maurus, Claudia Plant |
|
|
|
code |
3 |
Learning to Count Mosquitoes for the Sterile Insect Technique |
Yaniv Ovadia, Yoni Halpern, Dilip Krishnan, Josh Livni, Daniel Newburger, Ryan Poplin, Tiantian Zha, D. Sculley |
|
|
|
code |
3 |
Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention |
Bin Tong, Martin Klinkigt, Makoto Iwayama, Toshihiko Yanase, Yoshiyuki Kobayashi, Anshuman Sahu, Ravigopal Vennelakanti |
|
|
|
code |
3 |
Local Algorithm for User Action Prediction Towards Display Ads |
Hongxia Yang, Yada Zhu, Jingrui He |
|
|
|
code |
2 |
Unsupervised P2P Rental Recommendations via Integer Programming |
Yanjie Fu, Guannan Liu, Mingfei Teng, Charu C. Aggarwal |
|
|
|
code |
2 |
Multi-task Function-on-function Regression with Co-grouping Structured Sparsity |
Pei Yang, Qi Tan, Jingrui He |
|
|
|
code |
2 |
Communication-Efficient Distributed Block Minimization for Nonlinear Kernel Machines |
ChoJui Hsieh, Si Si, Inderjit S. Dhillon |
|
|
|
code |
2 |
Estimation of Recent Ancestral Origins of Individuals on a Large Scale |
Ross E. Curtis, Ahna Reza Girshick |
|
|
|
code |
2 |
Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data |
Zhaobin Kuang, Peggy L. Peissig, Vítor Santos Costa, Richard Maclin, David Page |
|
|
|
code |
2 |
Learning Tree-Structured Detection Cascades for Heterogeneous Networks of Embedded Devices |
Hamid Dadkhahi, Benjamin M. Marlin |
|
|
|
code |
2 |
Three Principles of Data Science: Predictability, Stability and Computability |
Bin Yu |
|
|
|
code |
2 |
Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? |
Usama M. Fayyad, Arno Candel, Eduardo Ariño de la Rubia, Szilárd Pafka, Anthony Chong, JeongYoon Lee |
|
|
|
code |
2 |
Is the Whole Greater Than the Sum of Its Parts? |
Liangyue Li, Hanghang Tong, Yong Wang, Conglei Shi, Nan Cao, Norbou Buchler |
|
|
|
code |
2 |
Construction of Directed 2K Graphs |
Bálint Tillman, Athina Markopoulou, Carter T. Butts, Minas Gjoka |
|
|
|
code |
2 |
Automatic Application Identification from Billions of Files |
Kyle Soska, Christopher S. Gates, Kevin A. Roundy, Nicolas Christin |
|
|
|
code |
2 |
Randomization or Condensation?: Linear-Cost Matrix Sketching Via Cascaded Compression Sampling |
Kai Zhang, Chuanren Liu, Jie Zhang, Hui Xiong, Eric P. Xing, Jieping Ye |
|
|
|
code |
1 |
Spaceborne Data Enters the Mainstream |
David Potere |
|
|
|
code |
1 |
Evaluating U.S. Electoral Representation with a Joint Statistical Model of Congressional Roll-Calls, Legislative Text, and Voter Registration Data |
Zhengming Xing, Sunshine Hillygus, Lawrence Carin |
|
|
|
code |
1 |
GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources |
Saurav Ghosh, Prithwish Chakraborty, Bryan L. Lewis, Maimuna S. Majumder, Emily Cohn, John S. Brownstein, Madhav V. Marathe, Naren Ramakrishnan |
|
|
|
code |
1 |
A Practical Algorithm for Solving the Incoherence Problem of Topic Models In Industrial Applications |
Amr Ahmed, James Long, Daniel Silva, Yuan Wang |
|
|
|
code |
1 |
Industrial Machine Learning |
Josh Bloom |
|
|
|
code |
0 |
Machine Learning Software in Practice: Quo Vadis? |
Szilárd Pafka |
|
|
|
code |
0 |
SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis |
Qiming Huang, Michael Zhu |
|
|
|
code |
0 |
Foreword to the Applied Data Science: Invited Talks Track at KDD-2017 |
Usama M. Fayyad, Evangelos Simoudis, Ashok Srivastava |
|
|
|
code |
0 |
More than the Sum of its Parts: Building Domino Data Lab |
Eduardo Ariño de la Rubia |
|
|
|
code |
0 |
Mining Big Data in NeuroGenetics to Understand Muscular Dystrophy |
Andy Berglund |
|
|
|
code |
0 |
Planning and Learning under Uncertainty: Theory and Practice |
Jonathan P. How |
|
|
|
code |
0 |
It Takes More than Math and Engineering to Hit the Bullseye with Data |
Paritosh Desai |
|
|
|
code |
0 |
Addressing Challenges with Big Data for Media Measurement |
Mainak Mazumdar |
|
|
|
code |
0 |
Mixture Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting |
GuoJun Qi, Jiliang Tang, Jingdong Wang, Jiebo Luo |
|
|
|
code |
0 |