Title:

AMAP: Automatically Mining Abbreviation Expansions in Programs to Enhance Software Maintenance Tools

Authors:

Emily Hill, Zachary P. Fry, Haley Boyd, Giriprasad Sridhara, Yana Novikova, Lori Pollock, and K. Vijay-Shanker

Abstract:

When writing software, developers often employ abbreviations in identifier names. In fact, some abbreviations may never occur with the expanded word, or occur more often in the code. However, most existing program comprehension and search tools do little to address the problem of abbreviations, and therefore may miss meaningful pieces of code or relationships between software artifacts. In this paper, we present an automated approach to mining abbreviation expansions from source code to enhance software maintenance tools that utilize natural language information. Our scoped approach uses contextual information at the method, program, and general software level to automatically select the most appropriate expansion for a given abbreviation. We evaluated our approach on a set of 250 potential abbreviations and found that our scoped approach provides a 57% improvement in accuracy over the current state of the art.

Book Title:

MSR 2008: 5th Working Conference on Mining Software Repositories

Date:

May 2008

Project:

Natural Language Program Analysis

Document Type:

Conference Proceedings

Key Words:

Automatic abbreviation expansion, software maintenance, program comprehension, software tools

Files:

[presentation slides: Adobe PDF] (1020 KB)
[preprint: Adobe PDF] (180 KB)

Bibtex Entry:

@inproceedings{123456789/187,
author = {Emily Hill and Zachary P. Fry and Haley Boyd and Giriprasad Sridhara and Yana Novikova and Lori Pollock and K. Vijay-Shanker},
title = {AMAP: Automatically Mining Abbreviation Expansions in Programs to Enhance Software Maintenance Tools},
booktitle = {MSR 2008: 5th Working Conference on Mining Software Repositories},
month = {May},
year = {2008}
}