Home
Research Publications
Teaching Student Funding Service
Diversity
Talks
GitHub

Publications 

A complete list of publications in reverse chronological order. Selected research papers over 10 pages at top tier venues are marked in red.authorship footnote

2025

Fuzzing MLIR compilers with Custom Mutation Synthesis, Ben Limpanukorn, Jiyuan Wang, Hong Jin Kang, Eric Zitong Zhou, Miryung Kim, 47th International Conference on Software Engineering (ICSE '25) 12 pages (an earlier version at arXiv)

2024

Dagstuhl on Code Search: A Journey through Searching Similar Code, Invited Keynote (pdf)
 
Natural Symbolic Execution-based Testing for Big Data Analytics, Yaoxuan Wu, Ahmad Humayun, Muhammad Ali Gulzar, and Miryung Kim, ACM International Conference on the Foundations of Software Engineering 2024 (pdf) (NaturalSym github) (FSE '24)

Scaling Code Pattern Inference with Interactive What-If Analysis, 46th International Conference on Software Engineering (ICSE '24) 12 pages (pdf) (demo) (SURF github)

Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking, Hong Jin Kang*, Fabrice Harel-Canada*, Muhammad Ali Gulzar, Violet Peng, and Miryung Kim, NAACL '24 (pdf) (Provenance Inspector github)

DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra Efficiency, Haoran Ma, Yifan Qiao, Shi Liu, Shan Yu, Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Yiying Zhang, Miryung Kim, Harry Xu (pdf) (OSDI '24)

Harvesting Idle Memory for Application-managed Soft State with Midas, Yifan Qiao, Zhenyuan Ruan, Haoran Ma, Adam Belay, Miryung Kim, Harry Xu (pdf) (Midas github) (NSDI '24)

2023

Software Engineering for Data Intensive Scalable Computing and Heterogeneous Computing, ICSE 2023 Future of Software Engineering, Invited paper (16 pages) (pdf)

NaturalFuzz: Natural Input Generation for Big Data Analytics, Ahmad Humayun, Yaoxuan Wu, Miryung Kim, Muhammad Ali Gulzar, 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023), 12 pages (pdf)

Co-Dependence Aware Fuzzing for Dataflow-based Big Data Analytics, Ahmad Humayun, Miryung Kim, Muhammad Ali Gulzar, ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023) 12 pages (pdf)

Leveraging Hardware Probes and Optimizations for Accelerating Fuzz Testing of Heterogeneous Applications, Jiyuan Wang, Qian Zhang, Hongbo Rong, Harry Xu, Miryung Kim, ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023) 12 pages (pdf)

Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony, Yifan Qiao and Chenxi Wang, Zhenyuan Ruan and Adam Belay, Qingda Lu, Yiying Zhang, Miryung Kim and Guoqing Harry Xu (NSDI '23) (pdf)

Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory, Chenxi Wang, Yifan Qiao, Haoran Ma, Shi Liu, Yiying Zhang, Wenguang Chen, Ravi Netravali, Miryung Kim, Guoqing Harry Xu. 18 pages, NSDI '23 (pdf)

A Declarative Metamorphic Testing Framework for Autonomous Driving, Yao Deng, Xi Zheng, Guannan Lou, Tianyi Zhang, Huai Liu, Miryung Kim, Tsong Chen, 18 pages, IEEE TSE 2023 (doi)

2022

HeteroGen: Transpiling C to Heterogeneous HLS Code with Automated Test Generation and Program Repair, Qian Zhang, Jiyuan Wang, Harry Xu, Miryung Kim, 27th International Conference on Architectural Support for Programming Languages and Operating Systems, 12 pages, ASPLOS '22 (local pdf, slides, Github HeteroGen)

Mako: A Low-Pause, High-Throughput Evacuating Collector for Memory-Disaggregated Datacenters, H. Ma, S. Liu, C. Wang, Y. Qiao, M. Bond, S. Blackburn, M. Kim, G. Xu PLDI '22, (local_pdf, slides, Github_Mako 

Concept-Annotated Examples for Library Comparison, Litao Yan, Miryung Kim, Björn Hartmann, Tianyi Zhang, Elena L. Glassman, ACM Symposium on User Interface Software and Technology, UIST '22, (pdf)

Sibylvariant Transformatimons for Robust Text Classification, Fabrice Y Harel-Canada, Muhammad Ali Gulzar, Nanyun Peng, Miryung Kim. Findings of ACL '22 (local_pdf, slides, Github_Sibyl, video, doi)

2021

QDiff: Differential Testing of Quantum Software Stacks, Jiyuan Wang, Qian Zhang, Guoqing Harry Xu, Miryung Kim, The 36th IEEE/ACM International Conference on Automated Software Engineering, 12 pages, ASE '21 (local_pdf, presentation video, GitHub-Qdiff), featured in ACM SIGSOFT Research Highlights in 2022 (link)
 
Fuzz Testing to Detect Platform Dependent Divergence for Heterogeneous Applications, Qian Zhang, Jiyuan Wang, Miryung Kim, 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 12 pages, ESEC/FSE ‘21 (DOI, local_pdf, presentation video, GitHub-HeteroFuzz)

Dorylus: Affordable, Scalable, and Accurate GNN Training over Billion-Edge Graphs, John Thorpe, Yifan Qiao, Jonathan Eyolfson, Shen Teng, Guanzhou Hu, Zhihao Jia, Jinliang Wei, Keval Vora, Ravi Netravali, Miryung Kim, Guoqing Harry Xu, OSDI '21 (DOI, presentation, GitHub-Dorylus)

OptDebug: Fault-Inducing Operation Isolation for Dataflow Applications, Muhammad Ali Gulzar, Miryung Kim, ACM Symposium on Cloud Computing 2021, 12 pages, SoCC '21 (local_pdf, DOI, slides). 

Efficient Fuzz Testing for Apache Spark Using Framework Abstraction by Qian Zhang, Jiyuan Wang, Muhammad Ali Gulzar, Rohan Padhye, Miryung Kim, ICSE '21 Tool Demonstration. 4 pages, (DOI, local_pdf, video
, GitHub-BigFuzz)

2020

Talk slides: ICSME's 10-Year Retrospective Most Influential Paper Award, Template-based reconstruction of complex refactorings, the 36th IEEE International Conference on Software Maintenance and Evolution, ICSME 2020 (youtube, slides)

Semeru: A Memory-Disaggregated Managed Runtime,  Chenxi Wang, Haoran Ma, Shi Liu, Yuanqi Li, Zhenyuan Ruan, Khanh Nguyen, Michael Bond, Ravi Netravali, Miryung Kim, Guoqing Harry Xu, The 14th USENIX Symposium on Operating Systems Design and Implementation, 12 pages (paper and slide link, GitHub-Semeru), OSDI '20 
 
Influence-Based Provenance for Dataflow Applications with Taint Propagation, Jason Teoh, Muhammad Ali Gulzar, Miryung Kim, ACM Symposium on Cloud Computing 2020. 12 pages, SoCC '2020 (local_pdf, video, DOI, GitHub-FlowDebug)

BigFuzz: Efficient Fuzz Testing for Data Analytics using Framework Abstraction, Qian Zhang, Jiyuan Wang, Muhammad Ali Gulzar, Rohan Padhye, Miryung Kim The 35th IEEE/ACM International Conference on Automated Software Engineering, 12 pages, ASE '20 (local_pdf, slides, code and replication_package, video, GitHub-BigFuzz)

Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? Fabrice Harel-Canada, Lingxiao Wang, Muhammad Ali Gulzar, Quanquan Gu, Miryung Kim, ACM SIGSOFT International Symposium on the Foundations of Software Engineering, ESEC/FSE '20 (local_pdf, replication_package, DOI, video, GitHub-DeepLearningTest)

JShrink: In-depth Investigation into Debloating Modern Java Applications, Bobby Bruce*, Tianyi Zhang*, Jaspreet Arora, Guoqing Harry Xu, Miryung Kim, ACM SIGSOFT International Symposium on the Foundations of Software Engineering, 12 pages, ESEC/FSE '20 (local_pdf, replication package, web tutorial, jdebloat ONR software security summer school, GitHub-JShrink) * are equal co-first authors

HeteroRefactor: Refactoring for Heterogeneous Computing with FPGA, Jason Lau*, Aishwarya Sivaraman*, Qian Zhang*, Muhammad Ali Gulzar, Jason Cong, Miryung Kim [* are equal co-first authors, ordered alphabetically by their last names.] Proceedings of 42nd IEEE/ACM International Conference on Software Engineering, 13 pages, ICSE ’20 (local pdf, slides, video, replicaton package, GitHub-HeteroRefactor)

Enabling Data-Driven API Design with Community Usage Data: A Need-Finding Study, Tianyi Zhang, Björn Hartmann, Miryung Kim, Elena Glassman, 2020 ACM Conference on Human Factors in Computing Systems, 12 pages, CHI ’20 (local pdf, slides, video)

An Analysis of Adversarial Attacks and Defenses on Autonomous Driving Models, 18th Annual IEEE International Conference on Pervasive Computing and Communications 2020, Yao Deng, Xi Zheng, Tianyi Zhang, Chen Chen, Guannan Lou, Miryung Kim, 10 pages, PerCom '20 (local pdf)

Characterizing and Identifying Composite Refactorings: Concepts, Heuristics and Patterns,  Leonardo Da Silva Sousa, Diego Cedrim, Alessandro Garcia, Willian Oizumi, Ana Carla Bibiano, Daniel Oliveira, Miryung Kim, Anderson Oliveira, Mining Software Repositories 2020, 12 pages, MSR'20 (local pdf)

SE4DA: Software Engineering for Data Analytics, Pre-print for Special Issue on IEEE Software, The AI Effect: Working at the Intersection of AI and Software Engineering, Miryung Kim, IEEE Software (doi, local_pdf

WebJShrink: A Web Service for Debloating Java Bytecode, Konner Macias, Mihir Mathur, Bobby R. Bruce, Tianyi Zhang, Miryung Kim, ACM SIGSOFT International Symposium on the Foundations of Software Engineering, 12 pages, ESEC/FSE '20 Tool Demonstration (video, pdf, DOI, GitHub-WebJShrink)

BigTest: Symbolic Execution Based Systematic Test Generation Tool for Apache Spark, Muhammad Ali Gulzar, Madan Musuvathi, Miryung Kim, Proceedings of 42nd IEEE/ACM International Conference on Software Engineering, 4 pages, ICSE ’20 Tool Demonstration (local_pdf, presentation video, code, demo)

Muhammad Ali Gulzar's PhD Dissertation: Automated Testing and Debugging for Big Data Analytics (local_pdf, UCLA)

2019

PerfDebug: Performance Debugging of Computation Skew in Dataflow Systems, Jason Teoh, Muhammad Ali Gulzar, Guoqing Harry Xu, and Miryung Kim, SoCC '19: ACM Symposium on Cloud Computing 2019. 14 pages (local_pdf, slides, GitHub-PerfDebug)

White-Box Testing of Big Data Analytics with Complex User-Defined Functions, Muhammad Ali Gulzar, Shaghayegh Mardani, Madanlal Musuvathi, Miryung Kim, ESEC/FSE '19: The 27th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (local_pdf, DOI, data set and tool, GitHub-BigTest) 12 pages

Analyzing and Supporting Adaptation of Online Code Examples, Tianyi Zhang, Di Yang, Cristina Lopes, Miryung Kim, ICSE '19: Proceedings of 41st IEEE/ACM International Conference on Software Engineering, 12 pages (local_pdf, slides, DOI, data set and tool, GitHub-ExampleStack)

Active Inductive Logic Programming for Code Search, Aishwarya Sivaraman, Tianyi Zhang, Guy Van den Broeck, Miryung Kim, ICSE '19: Proceedings of 41st IEEE/ACM International Conference on Software Engineering, 12 pages (local_pdf, slides, DOI, data set and tool, GitHub-ALICE)

An Empirical Study of Common Challenges in Developing Deep Learning Applications, Tianyi Zhang, Cuiyun Gao, Lei Ma, Michael R. Lyu and Miryung Kim, ISSRE'19: Proceedings of 30th International Symposium on Software Reliability Engineering, 12 pages. (local pdf)

Generuk: Thin Computation over Big Native Data Using Speculative Program Transformation, Cheng Cai, Christian Navasca, Khanh Nguyen, Brian Demsky, Shan Lu, Miryung Kim, Guoqing Harry Xu: SOSP '19, The 27th ACM Symposium on Operating Systems Principles, 12 pages. (local pdf, DOI)

Book Chapter on Software Evolution, Miryung Kim, Na Meng, and Tianyi Zhang, Handbook of Software Engineering, Editors: Cha, Taylor, Kang, Springer, pages 223-284 (local_pdf, DOI)

Tianyi Zhang's PhD Dissertation: Leveraging Program Commonalities and Variations for Systematic Software Development and Maintenance (local_pdf, UCLA)

2018

Are Code Examples on an Online Q&A Forum Reliable? A Study of API Misuse on Stack Overflow, Tianyi Zhang, Ganesha Upadhyaya, Anastasia Reinhardt, Hridesh Rajan and Miryung Kim, ICSE '18: Proceedings of 40th IEEE/ACM International Conference on Software Engineering, pages  886-896, (local_pdf, slides, DOI, GitHub-ExampleCheck)

Visualizing API Usage Examples at Scale, Elena L. Glassman*, Tianyi Zhang*, Björn Hartmann, Miryung Kim, CHI '18: Proceedings of ACM CHI Conference on Human Factors in Computing Systems, 12 pages, Paper No. 580, (local_pdf, demo, slides, DOI, demo video). *The two lead authors contributed equally to the work, as part of an equal collaboration between the two institutions.

BigSift: Automated Debugging of Big Data Analytics in Data-Intensive Scalable Computing, Muhammad Ali Gulzar, Siman Wang, Miryung Kim, ESEC/FSE '18 Research Demonstrations, pages 863-866 (local_pdf, demo video, slides, DOI GitHub-BigSift)  

Augmenting Stack Overflow with API Usage Patterns Mined from GitHub, Anastasia Reinhardt, Tianyi Zhang, Mihir Mathur, Miryung Kim, ESEC/FSE '18 Research Demonstrations, pages 880-883 (local_pdf, demo video, slides, DOI, GitHub-ExampleCheck)

2017

Data Scientists in Software Teams: State of the Art and Challenges, Miryung Kim, Thomas Zimmermann, Robert DeLine, and Andrew Begel, TSE: IEEE Transactions on Software Engineering, 16 pages (Journal First, Accepted, August 2017) (local_pdf, slides). A journal first presentation at ICSE 2018

Automated Debugging in Data-Intensive Scalable Computing, Muhammad Ali Gulzar, Matteo Interlandi, Xueyuan Han, Mingda Li, Tyson Condie, Miryung Kim, SoCC '17: ACM Symposium on Cloud Computing 2017, pages 520-534 (pdf, slides, GitHub-BigSift)

Automated Transplantation and Differential Testing for Clones, Tianyi Zhang, Miryung Kim, ICSE '17: Proceedings of 39th IEEE/ACM International Conference on Software Engineering, pages 665-676 (DOI) (local_pdf) (slides)

Refactoring Inspection Support for Manual Refactoring Edits, Everton L.G. Alves, Myoungkyu Song, Tiago Massoni, Patricia D. L. Machado, Miryung Kim, TSE: IEEE Transactions on Software Engineering, 20 pages (Accepted, March 2017) (DOI, local_pdf, GitHub-RefDistiller)

Debugging Big Data Analytics in Spark with BigDebug, Muhammad Ali Gulzar, Matteo Interlandi, Tyson Condie, Miryung Kim, SIGMOD' 17: Proceedings of The 2017 ACM SIGMOD/PODS Conference, Demonstration Track, pages 1627-1630, 2016, (DOI) (local_pdf) (demo video) (GitHub-BigDebug)

Adding Data Provenance Support to Apache Spark, Matteo Interlandi, Ari Ekmekji, Kshitij Shah, Muhammad Ali Gulzar, Sai Deep Tetali, Miryung Kim, Todd Millstein, Tyson Condie, VLDB Journal (Accepted, July 2017) (DOI) Special issue on best papers of VLDB 2016 (GitHub-Titian)

Appendix to Everything You Wanted to Know About Data Scientists, Miryung Kim, Thomas Zimmermann, Robert DeLine, and Andrew Begel, (Preliminary Version, July 2017) (local_pdf)

2016

BigDebug: Debugging Primitives for Interactive Big Data Processing in Spark, Muhammad Ali Gulzar, Matteo Interlandi, Seunghyun Yoo, Sai Deep Tetali, Tyson Condie, Todd Millstein, Miryung Kim, ICSE '16: Proceedings of 38th IEEE/ACM International Conference on Software Engineering, pages 784-795 (DOI) (local_pdf) (slides) (GitHub-BigDebug)

The Emerging Role of Data Scientists on Software Development Teams, Miryung Kim, Thomas Zimmermann, Robert DeLine, Andrew Begel, ICSE '16: Proceedings of 38th IEEE/ACM International Conference on Software Engineering, pages 96-107 (DOI) (local_pdf) (slides)

Appendix to The Emerging Role of Data Scientists on Software Development Teams (local_pdf) (link)

Titian: Data Provenance Support in Spark, Matteo Interlandi, Kshitij Shah, Sai Tetali, Muhammad  Gulzar, Seunghyun Yoo, Miryung Kim, Todd Millstein, Tyson Condie, VLDB '16 (PVLDB Volume 9 Issue 3): Proceedings of the 42nd Conference on Very Large Data Bases. pages 216-227, (DOI) (local_pdf) (GitHub-Titian)

BigDebug: Interactive Debugger for Big Data Analytics in Apache Spark, Muhammad Ali Gulzar, Matteo Interlandi, Tyson Condie, Miryung Kim, FSE '16: The 24th ACM SIGSOFT International Symposium on the Foundations of Software Engineering, Demonstration Track, pages 1033-1037 (DOI) (local_pdf) (extended version with screenshots) (demo video) (poster) (slides) (GitHub-BigDebug)

Optimizing Interactive Development of Data-Intensive Applications, Matteo Interlandi, Sai Deep Tetali, Muhammad Ali Gulzar, Joseph Noor, Tyson Condie, Miryung Kim, Todd D. Millstein, SoCC '16: ACM Symposium on Cloud Computing 2016, pages 510-522 (DOI) (local_pdf)

Interactive Debugging for Big Data Analytics, Muhammad Ali Gulzar, Xueyuan Han, Matteo Interlandi, Shaghayegh Mardani, Sai Deep Tetali, Tyson Condie, Todd Millstein, Miryung Kim, HotCloud 2016, The 8th USENIX Workshop on Hot Topics in Cloud Computing, 5 pages (DOI) (local_pdf)

Trusted Software Repair for System Resiliency, Westley Weimer, Stephanie Forrest, Miryung Kim, Claire Le Goues, Patrick Hurley, DSN' 16, The 46th IEEE/IFIP International Conference on Dependable Systems and Networks, Industry Track, pages 238-241 (DOI) (local_pdf).

An Empirical Study of Supplementary Patches in Open Source Projects, Jihun Park, Miryung Kim, Doo-Hwan Bae, Journal of Empirical Software Engineering, 22(1): pages 436-473 (2017) Preprint DOI: 10.1007/s10664-016-9432-x (Date of Publication: 07 May 2016) (DOI) (local_pdf)

Prioritizing Test Cases for Early Detection of Refactoring Faults, Everton L.G. Alves, Patricia D. L. Machado, Tiago Massoni, Miryung Kim, Journal of Software Testing, Verification, and Reliability, 2016,  Preprint DOI: 10.1002/stvr.1603 (Date of Publication: 21 March 2016), Volume 26, Issue 5, pages 402?426, August 2016 (DOI) (local_pdf)

Perceptions on the State of the Art in Verification and Validation in Cyber-Physical Systems, Xi Zheng, Christine Julien, Miryung Kim, Sarfraz Khurshid, IEEE Systems Journal, Preprint DOI: 10.1109/JSYST.2015.2496293 (Date of Publication: 20 November 2015), 14 pages (DOI) (local_pdf)

2015

Does Automated Refactoring Obviate Systematic Editing? Na Meng, Lisa Hua, Miryung Kim, and Kathryn McKinley, ICSE '15: Proceedings of 37th IEEE/ACM International Conference on Software Engineering, pages 392-402 (DOI) (local pdf) (slides)

Interactive Code Review for Systematic Changes, Tianyi Zhang, Myoungkyu Song, Joseph Pinedo, Miryung Kim, ICSE '15: Proceedings of 37th IEEE/ACM International Conference on Software Engineering, pages 111-122 (DOI) (local pdf) (slides)

2014

An Empirical Study of Refactoring Challenges and Benefits at Microsoft, Miryung Kim, Thomas Zimmermann, and Nachiappan Nagappan, TSE: IEEE Transactions on Software Engineering, Volume 40 No. 7: 633-649 (2014) (DOI) (local_pdf)

Cookbook: In Situ Code Completion Using Edit Recipes Learned from Examples, John Jacobellis, Na Meng, Miryung Kim, ICSE '14: Proceedings of 36th IEEE/ACM International Conference on Software Engineering, Formal Demonstration Track. pages, 584-587 (link) (local_pdf) (slides) (demo)

An empirical study on reducing omission errors in practice. Jihun Park, Miryung Kim, Doo-Hwan Bae, ASE '14: Proceedings of the 29th ACM/IEEE international conference on Automated software engineering, pages 121-126 (DOI) (local_pdf) (slides)

RefDistiller: a refactoring aware code review tool for inspecting manual refactoring edits. Everton L. G. Alves, Myoungkyu Song, Miryung Kim, FSE '14: Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, Research Demonstration Track, pages 751-754 (DOI) (poster) (local_pdf)

Critics: an interactive code review tool for searching and inspecting systematic changes,  Tianyi Zhang, Myoungkyu Song, Miryung Kim, FSE '14: Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, Research Demonstration Track, pages 755-758 (DOI) (poster) (local_pdf)

Recommending Program Transformations to Automate Repetitive Software Changes, Book Chapter on Recommendation Systems on Software Engineering. Miryung Kim, Na Meng, Editors: Martin Robillard, Walid Maalej, Robert Walker and Thomas Zimmermann, Springer (link) (local_pdf)

Refdistiller: Detecting anomalies in manual refactoring edits. E. L. Alves, M. Song, M. Kim, P. D. Machado, and T. Massoni. Technical report, University of Texas at Austin, TR-ECE-2014-3, March, 2014. (local_pdf

Experimental Artifacts for "RefDistiller: Detecting Anomalies in Manual Refactoring Edits," Everton L. G. Alves, Myoungkyu Song, Miryung Kim, Patricia Machado, Tiago Massoni, Technical Report (local pdf)

2013

LASE: Locating and Applying Systematic Edits by Learning from Examples, Na Meng, Miryung Kim, Kathryn McKinley, ICSE '13: Proceedings of 35th IEEE/ACM International Conference on Software Engineering, pages 502-511(DOI) (local pdf) (slides)

LASE: An Example-based Program Transformation Tool for Locating and Applying Systematic Edits, John Jacobellis, Na Meng, Miryung Kim, ICSE '13: Proceedings of 35th IEEE/ACM International Conference on Software Engineering, 4 pages, Formal Demonstrations Track, pages 1319-1322 (DOI) (local pdf) (demo video)

Identifying and Summarizing Systematic Code Changes via Rule Inference, Miryung Kim, David Notkin, Dan Grossman, Gary Wilson Jr. TSE: IEEE Transactions on Software Engineering, Volume 39 No 1. pages 45-62 (DOI) (local pdf).

Detecting and Characterizing Semantic Inconsistencies in Ported Code, Baishakhi Ray, Miryung Kim, Suzette Person, Neha Rungta, ASE '13: Proceedings of The 28th IEEE/ACM International Conference on Automated Software Engineering, pages 367-377, (DOI) (local pdf) (slides)

An Empirical Study of API Stability and Adoption in the Android Ecosystem, Tyler McDonnell, Baishakhi Ray, Miryung Kim, ICSM '13: Proceedings of 29th IEEE International Conference on Software Maintenance, pages 70-79, (DOI) (local pdf) (slides)

FaultTracer: A Spectrum-Based Approach to Localizing Failure-Inducing Program Edits, Lingming Zhang, Miryung Kim, Sarfraz Khurshid, Journal of Software: Evolution and Process, 25(12): 1357-1383 (2013) (DOI)

2012

A Case Study of Cross-System Porting in Forked Software Projects, Baishakhi Ray, Miryung Kim, FSE '12: ACM SIGSOFT the 20th International Symposium on the Foundations of Software Engineering, Article 53, 11 pages, (DOI) (local pdf) (slides).

A Field Study of Refactoring Challenges and Benefits, Miryung Kim, Thomas Zimmermann, Nachiappan Nagappan, FSE '12: ACM SIGSOFT the 20th International Symposium on the Foundations of Software Engineering, Article 50, 11 pages, (DOI) (local pdf) (slides).

Repertoire: A Cross-System Porting Analysis Tool for Forked Software Projects, Baishakhi Ray, Christopher Wiley and Miryung Kim FSE '12: ACM SIGSOFT the 20th International Symposium on the Foundations of Software Engineering, Formal Research Tool Demonstration, Article 8, 4 pages, (DOI) (local pdf) (slides)

FaultTracer: A Change Impact and Regression Fault Analysis Tool for Evolving Java Programs, Lingming Zhang, Miryung Kim and Sarfraz Khurshid, FSE '12: ACM SIGSOFT the 20th International Symposium on the Foundation of Software Engineering, Formal Research Tool Demonstration, Article 40, 4 pages (DOI) (local pdf) (slides)

An Empirical Investigation into the Impact of Refactoring on Regression Testing, Napol Rachatasumrit, Miryung Kim, ICSM '12: the 28th IEEE International Conference on Software Maintenance, pages 357-366, (DOI) (local pdf) (slides)

Specification-based Test Repair Using a Lightweight Formal Method, Guowei Yang, Sarfraz Khurshid, and Miryung Kim, FM '12: Proceedings of the 18th International Symposium on Formal Methods, pages 455-470, (local pdf) (DOI)

Vdiff: A Program Differencing Algorithm for Verilog Hardware Description Language, Adam Duley, Christopher Spandikow, Miryung Kim, Journal of Automated Software Engineering, An Invited Article from the 25th IEEE/ACM International Conference on Automated Software Engineering (ASE), pages 459-490, (DOI) (local pdf).

An Empirical Study of Supplementary Bug Fixes, Jihun Park, Miryung Kim, Baishakhi Ray, Doo-Hwan Bae, MSR '12: The 9th IEEE Working Conference on Mining Software Repositories, pages 40-49, (DOI) (local pdf) (slides). Invited to the Special Issue of Journal of Empirical Software Engineering (EMSE).

Brace: An Assertion Framework for Debugging Cyber-Physical Systems, Kevin Boos, Chien Liang Fok, Christine Julien, Miryung Kim, ICSE '12: Proceedings of 2012 International Conference on Software Engineering, 4 pages, New Ideas and Emerging Results (NIER) Track, pages 1341-1344, (local pdf) (DOI). 

Appendix to A Field Study of Refactoring Rationale, Benefits, and Challenges at Microsoft. Miryung Kim, Thomas Zimmermann, and Nachiappan Nagappan, Technical Report. MSR-TR2012-4. (pdf hosted at Microsoft Research)

A Graph-based Framework for Reasoning about Relationships among Software Modifications, Ripon Saha, Rui Qiu, Miryung Kim, Dewayne Perry, Technical Report, UT-TR2012-June (pdf)

2011

Localizing Fault-Inducing Program Edits Based on Spectrum Information, Lingming Zhang, Miryung Kim, Sarfraz Khurshid, ICSM' 11: Proceedings of the 27th IEEE International Conference on Software Maintenance, pages 23-32. DOI, (local pdf) Invited to the Special Issue of Journal of Software Maintenance and Evolution. 

Sydit: Creating and Applying a Program Transformation from an Example. Na Meng, Miryung Kim, Kathryn S. McKinley, ESEC/FSE' 11: Proceedings of the 19th ACM SIGSOFT Symposium on the Foundations of Software Engineering (FSE-19) and ESEC'11, Formal Research Demonstration, pages 440-443, DOI (local pdf).

Systematic Editing: Generating Program Transformations from an Example, Na Meng, Miryung Kim, Kathryn S. McKinley, PLDI' 11: Proceedings of the 32nd ACM SIGPLAN conference on Programming Language Design and Implementation, pages 329-342, DOI (local pdf) (slides)

An Empirical Investigation into the Role of API-Level Refactoring during Software Evolution, Miryung Kim, Dongxiang Cai, Sunghun Kim,  ICSE' 11: Proceedings of the 2011 ACM and IEEE 33rd International Conference on Software Engineering, pages 151-160, DOI (local pdf) (presentation) Nominated for ACM SIGSOFT Distinguished Paper Award.
 
Detecting Software Modularity Violations, Sunny Wong, Yuanfang Cai, Miryung Kim, Michael Dalton, ICSE' 11: Proceedings of the 2011 ACM and IEEE 33rd International Conference on Software Engineering, pages 411-420, DOI (local pdf) (presentation)

An Empirical Study of Long-Lived Code Clones,  Dongxiang Cai and Miryung Kim, FASE' 11: Proceedings of the Fundamental Approaches to Software Engineering, pages 432-446, DOI (local pdf) (presentation)

An Exploratory Study of Awareness Interests about Software Modifications, Miryung Kim, CHASE' 11: 4th International Workshop on Cooperative and Human Aspects of Software Engineering, co-located with 2011 ACM and IEEE 33rd International Conference on Software Engineering, pages 80-83, DOI (local pdf)

2010

Ref-Finder: a Refactoring Reconstruction Tool based on Logic Query Templates, Miryung Kim, Matthew Gee, Alex Loh, and Napol Rachatasumrit, FSE' 10: Proceedings of the 18th ACM SIGSOFT Symposium on the Foundations of Software Engineering, Pages 371-372, Publisher: ACM DOI, Formal Research Demonstration (local pdf

Validity Concerns in Software Engineering Research, Hyrum W. Right, Miryung Kim, and Dewayne Perry, FoSeR '10: FSE/SDP Workshop on the Future of Software Engineering Research, Pages: 411-414, Publisher: ACM DOI 

Managing Technical Debt in Software-Reliant Systems,  Nanette Brown, Yuanfang Cai, Yuepu Guo, Rick Kazman, Miryung Kim, Philippe Kruchten, Erin Lim, Alan MacCormack, Robert Nord, Ipek Ozkaya, Raghvinder Sangwan, Carolyn Seaman, Kevin Sullivan, Nico Zazworka,  FoSeR '10: FSE/SDP Workshop on the Future of Software Engineering Research, Pages: 47-52, Publisher: ACM DOI 

A Catalogue of Template Refactoring Rules, Kyle Prete, Napol Rachatasumrit, Miryung Kim,  Technical Report, Department of Electrical and Computer Engineering, August 15th, 2010, pdf

Template-based Reconstruction of Complex Refactorings, Kyle Prete, Napol Rachatasumrit, Nikita Sudan, and Miryung Kim, ICSM '10: Proceedings of the 26th IEEE International Conference on Software Maintenance, Pages 1-10,  Publisher: IEEE DOI, presentation (local pdf)

A Graph-based Approach to API Usage Adaptation, Hoan Anh Nguyen, Tung Thanh Nguyen, Gary Wilson Jr. Anh Tuan Nguyen, Miryung Kim, Tien Nguyen, OOPSLA '10: Proceedings of the 2010 ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications, Pages: 302-321, Publisher: ACM DOIpresentation (local pdf)

A Program Differencing Algorithm for Verilog HDL, Adam Duley, Chris Spandikow, Miryung Kim, ASE '10: Proceedings of the 25th IEEE/ACM International Conference on Automated Software Engineering, Pages: 477-486, Publisher: ACM DOIpresentation, ACM SIGSOFT Distinguished Paper Award (local pdf)

AURA: A Hybrid Approach to Identify Framework Evolution, Wei Wu, Yann-Gael Gueheneuc, Giuliano Antoniol, Miryung Kim,  ICSE '10: Proceedings of the 2010 ACM and IEEE 32nd International Conference on Software Engineering, Pages: Volume 1, 325-334, Publisher: ACM DOI (local pdf)

LSdiff: A Program Differencing Tool to Identify Systematic Structural Differences, Alex Loh and Miryung Kim, ICSE '10: Proceedings of the 2010 ACM and IEEE 32nd International Conference on Software Engineering, Pages: Volume 2, 263-266, Publisher: ACM DOI Formal Research Demonstrations (local pdf)

2009

Discovering and Representing Systematic Code Changes, Miryung Kim and David Notkin, ICSE '09: Proceedings of the 2009 IEEE 31st International Conference on Software Engineering, Pages 309-319, Publisher: IEEE DOI Presentation (local pdf)

2008

Analyzing and Inferring the Structure of Code Changes, Miryung Kim, Ph.D. Dissertation, University of Washington, November 2008 Pdf, Presentation

2007

Automatic Inference of Structural Changes for Matching across Program Versions, Miryung Kim, David Notkin, Dan Grossman, ICSE '07: Proceedings of the 29th international conference on Software Engineering, pages 333-343, Publisher: IEEE DOI, Presentation (local pdf)

SoftGUESS: Visualization and Exploration of Code Clones in Context, Eytan Adar, Miryung Kim, ICSE '07: Proceedings of the 29th international conference on Software Engineering, pages 762-766, Publisher: IEEE DOI

Understanding and Aiding Code Evolution by Inferring Change Patterns, Miryung Kim, ICSE '07: Companion to the proceedings of the 29th International Conference on Software Engineering, pages 101-102, Publisher: IEEE DOIPresentation 

2006

Program element matching for multi-version program analyses, Miryung Kim, David Notkin, MSR '06: Proceedings of the 2006 international workshop on Mining software repositories, pages 58-64, Publisher: ACM DOI, Presentation (local pdf)

TA-RE: an exchange language for mining software repositories, Sunghun Kim, Thomas Zimmermann, Miryung Kim, Ahmed Hassan, Audris Mockus, Tudor Girba, Martin Pinzger, E. James Whitehead, Jr., Andreas Zeller, MSR '06: Proceedings of the 2006 international workshop on Mining software repositories, pages 22-25, Publisher: ACM DOI 

2005

An empirical study of code clone genealogies, Miryung Kim, Vibha Sazawal, David Notkin, Gail Murphy, ESEC/FSE' 05: Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering, pages 187-196, Publisher: ACM DOIPresentation (local pdf) Nominated for ACM SIGSOFT Distinguished Paper Award.
   
Using a clone genealogy extractor for understanding and supporting evolution of code clones, Miryung Kim, David Notkin, MSR '05: Proceedings of the 2005 international workshop on Mining software repositories, pages 1-5, Publisher: ACM DOI (local pdf)

2004

A Study of Evolution in the Presence of Source-Derived Partial Design Representations, Vibha Sazawal, Miryung Kim, David Notkin, IWPSE '04: Proceedings of the 7th International Workshop on Principles of Software Evolution, pages 83-92, Publisher: IEEE DOI (local pdf)
 
An Ethnographic Study of Copy and Paste Programming Practices in OOPL, Miryung Kim, Lawrence Bergman, Tessa Lau, David Notkin, ISESE '04: Proceedings of the 2004 International Symposium on Empirical Software Engineering, pages 21-30, Publisher: IEEE DOI presentation (local pdf

Footnote on authorship. [Since people keep asking me about this, I will clarify: Before moving to UCLA, I put my name first among senior authors, if I am the lead senior author. After moving to UCLA, I put my name as the last author, if I am the lead senior author. The bottom line is I am thankful for the opportunity to collaborate with brilliant minds and produce awesome research together. ]