Text Message Categorization of Collaborative Learning Skills in Online Discussion using Neural Network
This paper presents research in neural network approach for text messages categorization of collaborative learning skill in an online discussion. Although a neural network is a popular method for text categorization in the research area of machine learning, unfortunately, the use of neural network in educational settings is rare. Usually, text categorization by neural network is employed to categorize news articles, emails, product reviews, and web pages. In an online discussion, text categorization that is used to classify the message sent by the student into a certain category is often manual, requiring skilled human specialists. However, human categorization is not an effective way for a number of reasons; time- consuming, labor-intensive, lack of consistency in a category, and costly. Therefore, this paper proposes a neural network approach to code the message automatically. Results show that neural networks achieving useful classification on eight categories of collaborative learning skills in an online discussion as measured based on precision, recall, and balanced F-measure.
 A.K.F. Lui, S.C. Li and S.O. Choy, “An evaluation of automatic text categorization in online discussion analysis,” Seventh IEEE International Conference on Advanced Learning Technologies, Niigata, Japan, 205-209. 2007
 W. Law and K.F. Low (1997). Automatic document classification based on probabilistic reasoning: Model and performance analysis. Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Vol.3, 2719-2723.
 Soucy, P & Mineau, G.W. (2001). A simple k-NN program for text categorization. The first IEEE International Conference on Data Mining (ICDM_01), Vol. 28, 647-648.
 Ma, L, Shepherd, J & Zhang, Y (2003). Enhancing text clasification using synopses extraction. The fourth International Conference on Web Information System Engineering. 115-124.
 Farkas, J. (1994). Generating document clusters using thesauri and neural networks. Canadian conference on Electrical and ComputerEngineering, Vol.2, 710- 713.
 Joachims, T. (1998). Text categorization with support vector machines: Learning with many relevant features. 10th European Conference on Machine Learning, 137-142.
 Benkhalifa, M., Bensaid, A. & Mouradi, A. (1999). Text categorization using the semi-supervised fuzzy c- means algorithm. 18th International conference of the north american fuzzy information processing society-NAFIPS, 561-568.
 Kazama. J & Tsujii, J. (2005). Maximum entropy models with inequality constrains: A case study on text categorization. Machine Learning. 60(1-3), 159- 194.
 Gabrilovich, E. & Markovitch, S. (2004). Text categorization with many redundant features: Using aggressive feature selection to make SVMs competitive with C4.5. Proceedings of the 21st International conference on machine learning, 321- 328.
 Yu, B., ben Xu, Z., hua Li, C. (2008). Latent semantic analysis for text categorization using neural network. Journal of Knowledge-Based Systems, 21, 900-904.
 Ng, H.T., Goh, W.B. & Low, K.L. (1997). Feature selection, perception learning, and usability case study for text categorization. 20th Annual international ACM-SIGIR Conference on research and development in information retrieval, 67-73.
 Savio L. Y. Lam & D. K. Lee (1999). Feature reduction for neural network based text categorization. Sixth International Conference on Database Systems for Advance Applications(DASFAA), 195-203.
 Gokhale, A.A. (1995). Collaborative learning enhance critical thinking. Journal of Technology Education, 7(1).
 Soller, A. (2001). Supporting social interaction in an intelligent collaborative learning system. Journal of Artificial Intelligence in Education, 12.
 McManus, M, & Aiken, R. (1995). Monitoring computer-based problem solving. Journal of Artificial Intelligence in Education, 6(4), 307-336.
 Hidalgo, J.M.G. (2003). Text representation for automatic text categorization, Eleventh conference of the european chapter of the association for computational linguisitics EACL 2003.
 Porter, M.F. (1997). An algorithm for suffix stripping. In Morgan Kaufmann, Readings in information retrieval, (pp. 313-316). Morgan Kaufmann Publishers Inc.
 Sebastiani, F. (2002). Machine learning in automated text categorization. ACM Computing Survey, 34(1), 1- 47.
 Yang, Y. & Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. Proceedings of ICML-97, 14th International conference on machine learning, 412-420.
 De Wever, B., Schellens, T., Valcke, M. & Van Keer, H. (2005). Content analysis schemes to analyze transcripts of online asynchronous discussion groups: A review, Computers & Education 46(1), pp. 6–28.
 Krippendorff (2004). Quantitative content analysis: An introduction to its method, Beverly Hills, Sage Publications.
 Salton, G., Wong, A. & Yang, C.S. (1975). A vector- space model for automatic indexing. In Communications of the ACM, Vol.18, Issue 11 (pp. 613–620). ACM Press.