Source author record

Dhanya Jothimani

Dhanya Jothimani appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning q-fin.ST Software Engineering

Catalog footprint

What is connected

2works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

Does chronology matter in JIT defect prediction? A Partial Replication Study

Just-In-Time (JIT) models detect the fix-inducing changes (or defect-inducing changes). These models are designed based on the assumption that past code change properties are similar to future ones. However, as the system evolves, the expertise of developers and/or the complexity of the system also changes. In this work, we aim to investigate the effect of code change properties on JIT models over time. We also study the impact of using recent data as well as all available data on the performance of JIT models. Further, we analyze the effect of weighted sampling on the performance of fix-inducing properties of JIT models. For this purpose, we used datasets from Eclipse JDT, Mozilla, Eclipse Platform, and PostgreSQL. We used five families of change-code properties such as size, diffusion, history, experience, and purpose. We used Random Forest to train and test the JIT model and Brier Score and the area under the ROC curve for performance measurement. Our paper suggests that the predictive power of JIT models does not change over time. Furthermore, we observed that the chronology of data in JIT defect prediction models can be discarded by considering all the available data. On the other hand, the importance score of families of code change properties is found to oscillate over time. To mitigate the impact of the evolution of code change properties, it is recommended to use a weighted sampling approach in which more emphasis is placed upon the changes occurring closer to the current time. Moreover, since properties such as "Expertise of the Developer" and "Size" evolve with time, the models obtained from old data may exhibit different characteristics compared to those employing the newer dataset. Hence, practitioners should constantly retrain JIT models to include fresh data.

preprint2016arXiv

Discrete Wavelet Transform-Based Prediction of Stock Index: A Study on National Stock Exchange Fifty Index

Financial Times Series such as stock price and exchange rates are, often, non-linear and non-stationary. Use of decomposition models has been found to improve the accuracy of predictive models. The paper proposes a hybrid approach integrating the advantages of both decomposition model (namely, Maximal Overlap Discrete Wavelet Transform (MODWT)) and machine learning models (ANN and SVR) to predict the National Stock Exchange Fifty Index. In first phase, the data is decomposed into a smaller number of subseries using MODWT. In next phase, each subseries is predicted using machine learning models (i.e., ANN and SVR). The predicted subseries are aggregated to obtain the final forecasts. In final stage, the effectiveness of the proposed approach is evaluated using error measures and statistical test. The proposed methods (MODWT-ANN and MODWT-SVR) are compared with ANN and SVR models and, it was observed that the return on investment obtained based on trading rules using predicted values of MODWT-SVR model was higher than that of Buy-and-hold strategy.