Incorporating qualitative information for credit risk assessment through frequent subtree mining for XML

Novita Ikasari, Fedja Hadzic, Tharam S. Dillon

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review


Credit risk assessment has been one of the most appealing topics in banking and finance studies, attracting both scholars' and practitioners' attention for some time. Following the success of the Grameen Bank, works on credit risk, in particular for Small Medium Enterprises (SMEs), have become essential. The distinctive character of SMEs requires a method that takes into account quantitative and qualitative information for loan granting decision purposes. In this chapter, we first provide a survey of existing credit risk assessment methods, which shows a current gap in the existing research in regards to taking qualitative information into account during the data mining process. To address this shortcoming, we propose a framework that utilizes an XML-based template to capture both qualitative and quantitative information in this domain. By representing this information in a domain-oriented way, the potential knowledge that can be discovered for evidence-based decision support will be maximized. An XML document can be effectively represented as a rooted ordered labelled tree and a number of tree mining methods exist that enable the efficient discovery of associations among tree-structured data objects, taking both the content and structure into account. The guidelines for correct and effective application of such methods are provided in order to gain detailed insight into the information governing the decision making process.

Original languageEnglish
Title of host publicationSmall and Medium Enterprises
Subtitle of host publicationConcepts, Methodologies, Tools, and Applications
PublisherIGI Global
Number of pages38
ISBN (Electronic)9781466638877
ISBN (Print)1466638869, 9781466638860
Publication statusPublished - 30 Apr 2013


Dive into the research topics of 'Incorporating qualitative information for credit risk assessment through frequent subtree mining for XML'. Together they form a unique fingerprint.

Cite this