Data Imbalance Problem In Text Classification