COUNTVECTORIZER YORDAMIDA SO‘ZLAR STATISTIKASINI ANIQLASH

Authors

  • Alayev Ruhillo Author
  • Maxmudjonova Gulshaxnoz Author

Keywords:

word statistics, text processing, frequency, text, tokenization

Abstract

This article provides an overview of CountVectorizer, an important tool in natural language processing and effective machine learning for text. Explains the CountVectorizer methodology and retrieves word frequencies and a document term matrix. CountVectorizer helps with word statistics and text analysis.

References

Achal J, Harshada J, Bavik J, Charmi Ch “Text Pre-Processing Techniques in Natural Language Processing: A Review” International Research Journal of Engineering and Technology (IRJET), 2022. –B. 878

Alayev R, Maxmudjonova G, “O‘zbek tilidagi matnli hujjatlarda izlashni amalga oshirishni takomillashtirish”, Toshkent: O‘zbekistan: til va madaniyat 2023. –B. 79,

Elov B, Hamroyeva Sh, Xusainova Z, Xudayberganov N (2023). “O‘zbek tili korpusi matnlarini qayta ishlashda CountVectorizer, TF-IDF hamda Co-occurrence matrix usullarining ahamiyati” Elektron lug’atlar yaratishning nazariy va amaliy asoslari mavzusidagi xalqaro ilmiy-amaliy anjuman materiallari ., Andijon-2023. – B. 81

Elov B., Hamroyeva Sh., Alaev R., Xusainova Z., Yodgorov U., “O‘zbek tili korpusi matnlarini qayta ishlash usullari” Raqamli Transformatsiya va Sun’iy Intellekt ilmiy jurnali, 2023. –B. 117-129.

Maxmudjonova G., “Nomuhim so‘zlar tushunchasi va uning ahamiyati”. Kompyuter lingvistikasi: muammolar, yechim, istiqbollar Xalqaro ilmiy-amaliy konferensiya materiallari, 2023. 204-211.

Xusainova, Z., Elov, B., Yodgorov, U., O‘zbek tili matnlari uchun tokenizayorni ishlab chiqish.., MUHAMMAD AL-XORAZMIY AVLODLARI ilmiy-amaliy va axborat- tahliliy jurnal, 2023. –B. 27

Downloads

Published

2024-06-24

Issue

Section

SECTION 4. Linguistic database and software of machine translation.

How to Cite

COUNTVECTORIZER YORDAMIDA SO‘ZLAR STATISTIKASINI ANIQLASH. (2024). «CONTEMPORARY TECHNOLOGIES OF COMPUTATIONAL LINGUISTICS», 2(22.04), 366-370. https://myscience.uz/index.php/linguistics/article/view/80

Similar Articles

11-20 of 60

You may also start an advanced similarity search for this article.