CSE 7331 Fall 2007 FINAL
PREDICTION BIODEGRADATION
The study of biodegradation of compounds in nature is an important research area in Environmental Engineering. However, the accurate prediction of which compounds actually biodegrade and the speed with which they degrade is a difficult problem yet to be solved. Previous prediction algorithms tend to rely on structural properties of compounds to do the prediction and create somewhat simplistic linear regression models. (Part of the problem with previous prediction algorithms is the lack of large amounts of reliable data. Unfortunately this will also be a problem with this project.)
Your project requires the development and comparison of data mining classification algorithms to predict biodegradability of compounds. You are provided a small dataset from which machine learning can take place. (If possible, a separate validation dataset will be provided.)
Assignment
Sample Projects