CCSU Logo

Data Mining

 

 

 

 

 

 

 

 

 

  Graduates of the MS in Data Mining Program

We are proud of our graduates.  Their theses are available for download through the CCSU library.
Please note that 100% of our graduates are employed using their new data mining expertise.

Jeffrey Allard (2013), An Application of Gradient Boosted Decision Trees and Random Forests to Prospect Direct Marketing Response Modeling.
Thesis Committee: Darius Dziuda (Chair), Daniel Larose, and Daniel Miller. 
Jeffrey is a Senior Research Analyst / Assistant Vice President at Fifth Third Bank in Michigan.


Thomas Wilk, Jr. (2013), Improving Workplace Accident Fatality Classification Models with Text Mining and Ensemble Methods.
Thesis Committee: Daniel Larose (Chair), Roger Bilisoly, and Daniel Miller. 


George DeVarennes (2013), Applying Cost Benefit Analysis to a Trinary Classification Model.
Thesis Committee: Daniel Larose (Chair), Daniel Miller, and Darius Dziuda. 
George is a Senior Business Intelligence Analyst at Ing Corporation in Connecticut.


Daniel Aloi (2013), Using Crime Prediction Models to Aid Law Enforcement in Resource Allocation and Decision Making.
Thesis Committee: Daniel Larose (Chair), Daniel Miller, and Darius Dziuda. 
Daniel is the Owner / Operator of Mathnasium in West Hartford, Connecticut.


William E. Rowe (2013), Classifying Web Pages by Image Attributes.
Thesis Committee: Daniel Larose (Chair), Roger Bilisoly, and Krishna Saha. 
William is a Database Analyst with NRT Web Services in Minnesota.


Martin Couture (2013), Applying Data Mining Techniques in Classifying Personal Automobile Insurance Risk.
Thesis Committee: Daniel Larose (Chair), Roger Bilisoly, and Daniel Miller. 
Martin is Senior Actuary for Travelers Insurance Company in Connecticut.


Steven Cultrera (2013), Analysis of the Impact of Weather on Runs Scored in Baseball Games at Fenway Park.
Thesis Committee: Daniel Larose (Chair), Roger Bilisoly, and Daniel Miller. 
Steve is Senior Research Consultant at The Hartford Insurance Company in Connecticut.


Kay Batta (2013), Applying Misclassification Costs to Ameliorate the False Positive Rate in Bioassay Screening.
Thesis Committee: Daniel Larose (Chair), Roger Bilisoly, and Darius Dziuda. 
Kay is ETL Analyst at IMS Health Corporation in Wisconsin.

Scott W. Burk, PhD (2012), Measuring Serial Emotional Content in the Enron Email Corpus.
Thesis Committee: Roger Bilisoly (Chair), Daniel Larose, and Darius Dziuda. 
Dr. Burk is Associate Vice President and Chief Statistician at Scott & White Health Plan in Texas.

Senthil Murugan (2012), Mining for Profitable Low-Risk Delta-Neutral Long Straddle Option Strategies.
Thesis Committee: Daniel Larose (Chair), Roger Bilisoly, and Zdravko Markov.
Senthil is a Senior Scientist at Cegedim Dendrite International.

Rajiv Sambisavan (2012), Modeling of Flight Delays.
Thesis Committee: Daniel Larose (Chair), Daniel Miller, and Darius Dziuda. 
Rajiv is a CSC Contractor for the Internal Revenue Service.

Giancarlo Crocetti (2012), Topical Discovery of Web Content.
Thesis Committee: Roger Bilisoly (Chair), Daniel Miller, and Daniel Larose. 
Giancarlo is an Information Architect at Boehringer-Ingolheim Pharmaceuticals in Connecticut.


Malcolm Houtz (2012), Applying Natural Language Processing and Document Classification to Text Mining RSS Feeds in order to Classify Documents as Interesting or Not, to an Analyst at the Company, Alliant.
Thesis Committee: Roger Bilisoly (Chair), Daniel Miller, and Daniel Miller. 
Malcolm is Analytics Director at Alliant Cooperative Data Solutions.


Edwin Rivera (2012), Anti-Money Laundering Behavior: Reducing the Number of Non-Productive Alerts in Structuring through Effective Data Mining.
Thesis Committee: Daniel Larose (Chair), Daniel Miller, and Roger Bilisoly. 
Edwin is an Optimization Statistician and VP at Citi Group in Florida.


Judith Gu (2012), Using Data Mining to Model Market Reaction to Corporate Earnings Announcements.
Thesis Committee: Daniel Larose (Chair), Daniel Miller, and Darius Dziuda. 
Judith works as an analyst at Goldman, Sachs, in New York City.

Sampson Adu-Poku (2012), Comparing Classification Algorithms in Data Mining.
Thesis Committee: Daniel Larose (Chair), Roger Bilisoly and Zdravko Markov. 
Sampson is Senior IT Project Consultant at United Health Group.


Judith Spomer (2009), Latent Semantic Analysis and Classification Modeling in Applications for Social Movement Theory.
Thesis Committee: Roger Bilisoly (Chair), Daniel Larose, and Zdravko Markov. 
Judith is Manager of Software Systems Research and Development at Sandia National Laboratories in New Mexico.

Thierry Vallaud (2009), Estimating Potential Customer Value Using Classification of Customer Data.
Thesis Committee: Daniel Larose (Chair), Daniel Miller, and Zdravko Markov.
Thierry now performs data mining for Socio Logiciels in Paris, France.  In fact, Thierry completed his entire program from France.

Donald Wedding, PhD (2009), Extending the Data Mining Software Packages SAS Enterprise Miner and SPSS Clementine to Handle Fuzzy Cluster Membership: Implementation with Examples.
Thesis Committee: Roger Bilisoly (Chair), Daniel Larose, and Zdravko Markov.
Dr. Wedding is Principal Industry Consultant for Advanced Analytics at the SAS Institute.

Kathleen Alber (2007), Identifying Patterns of Potentially Preventable Emergency Department Utilization by American Children.
Thesis Committee: Daniel Larose (Chair), Zdravko Markov, and Roger Bilisoly.
Kathleen is a Predictive Data Analyst at Service National Corporation.

Steven Barbee (2007), The Discovery by Data Mining of Rogue Equipment in the Manufacture of Semiconductor Devices.
Thesis Committee: Daniel Larose (Chair), Zdravko Markov, and Darius Dziuda.
Steven retired from IBM and now works for SPSS, Inc.

Eric Taylor (2005), Comparing Unsupervised Multivariate Normal Cluster Results between Datasets and Consolidating Similar Clusters.
Thesis Committee: Daniel Larose (Chair), Daniel Miller, and Zdravko Markov.
Eric works as a statistical analyst for United Technologies Corporation.

James Steck (2005), NETPIX: A Method of Feature Selection Leading to Accurate Sentiment-Based Classification Models.
Thesis Committee: Daniel Larose (Chair), Daniel Miller, and Zdravko Markov.
James completed all the program course work from his home in Washington state.  He now works as a statistician at Nordtrom Corporation in Washington. 


Rafiqul Islam (2004), Knowledge Discovery in Microarray Data
Thesis Committee: Daniel Larose (Chair), Zdravko Markov, and Roger Bilisoly.
Rafiq now works for a genomics analysis company in New Jersey.






Pictured here celebrating our first graduate Rafiqul Islam in 2004 are, left to right, Dr. Roger Bilisoly, thesis committee member, Dr. Daniel Larose, thesis committee chair, Dr. Timothy Craine, Chair of the Department of Mathematical Sciences, Dr. Susan Pease, Dean of the School of Arts and Sciences, Rafiqul Islam, Dr. Estela Lopez, Vice Chancellor of the Connecticut State University System, and Dr. Zdravko Markov, thesis committee member.