-
آرشیو :
نسخه زمستان 1401
-
کد پذیرش :
1414
-
موضوع :
سایر شاخه ها
-
نویسنده/گان :
| احسان باقری، مهدی حسنی نسب، ایوب محمدیان
-
زبان :
فارسی
-
نوع مقاله :
پژوهشی
-
چکیده مقاله به فارسی :
خلاصه سازی خودکار متن، یک پژوهش ضروری در پردازش زبان طبیعی است که تلاش میکند اسناد متنی را خلاصه کند تا کاربران بتوانند به سرعت به اطلاعات مفید دسترسی پیدا کنند. با وجود اینکه در زبان فارسی تلاشهایی برای ایجاد خلاصهسازی متون صورت گرفته است، اغلب موارد این پژوهشها به فرم نظریه ارائه شده است. در این پژوهش سعی میشود یک سیستم بر اساس رتبه متن و گراف تحلیلگر نحوی جملات عمل کند و سپس نتایج برای پردازش به الگوریتم معروف رتبه صفحه هدایت شود که جملات با بالاترین رتبه گراف را استخراج کند. سپس به همین روش، کلمات استخراج میشود و نهایتا با استفاده از الگوریتمهای فرااکتشافی، برخی از جملات دارای کلمات کلیدی به متن استخراج شده افزوده میشوند. لازم به ذکر است اگرچه الگوریتم پیشنهادی، از تحلیلصرف و نحوی برای شباهت بین جملات استفاده میکند، با اینحال این کار در زبان فارسی میتواند توسط الگوریتمهای مختلفی صورت پذیرد. با توجه به اینکه تاکنون سیستم مذکور پیشنهاد نشده است و همچنین در خصوص مقایسه تحلیل نحوی جملات در زبان فارسی نیز تحقیق جامعی انجام نگرفته است، این موضوعات جدید محسوب میشود. نتایج برنامه نه تنها به با جامعه آماری وسیع تری نسبت به موارد مشابه مقایسه گردیده است بلکه از نظر متون تخصصی و استفاده از سخت افزار معمول هم مورد توجه قرار گرفته است و در همه حالات توانسته نتایج عملی قابل قبولی را کسب نماید.
-
لیست منابع :
[1] M. V. V. V. M. S. I. Ashmarina, Digital age: chances, challenges and future, 2020.
[2] M. Rosemann, "Structuring in the Digital Age," The Art of Structuring, 2019.
[3] H. Dalianis, "A Text Summarizer for Swedish," NADA, KTH, Stockholm, 2000.
[4] M. Hassel and N. Mazdak, "FarsiSum - A Persian Text Summarizer," Proceedings of the Workshop on Computational Approaches to Arabic Script-based Languages, p. 82–84, 2004.
[5] Azadeh Zamanifar, Behrouz Minaei and Mohsen Sharifi, "A New Hybrid Farsi Text Summarization Technique Based on Term Co-Occurrence and Conceptual Property of the Text," Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008.
[6] T. A. a. M. E. J. Mehrnoush Shamsfard, "Persian Document Summarization by Parsumist," World Applied Sciences Journal 7 (Special Issue of Computer & IT): 1, 2009.
[7] A. Zamanifar and O. Kashefi, "AZOM: A Persian Structured Text Summarizer," Natural Language Processing and Information Systems, p. 234–237, 2011.
[8] Fatemeh Shafiee and Mehrnoush Shamsfard, "Similarity versus relatedness: A novel approach in extractive Persian document summarisation," Journal of Information Science, 2017.
[9] Tayyebeh Hosseinikhah, Abbas Ahmadi and Azadeh Mohebi, "A new Persian Text Summarization Approach based on Natural Language Processing and Graph Similarity," Iranian Journal of Information Processing and Management, 2018.
[10] F. Kiyoumarsi and F. Esfahani, "Optimizing Persian Text Summarization Based on Fuzzy Logic Approach," International Conference on Intelligent Building and Management, 2011.
[11] M. Tofighy, O. Kashefi and H. Javadi, "Persian Text Summarization Using Fractal Theory," in Communications in Computer and Information Science ·, 2011.
[12] M. Bazghandi, G. Tadayon, T. Jahan and M. Vafaei, "Extractive Summarization of Farsi Documents Based on PSO Clustering," IJCSI International Journal of Computer Science, 2012.
[13] Seyyed Mohsen Tofighy, Ram Gopal Raj and Hamid Haj Seyyed Javad, "AHP Techniques for Persian Text Summarization," Malaysian Journal of Computer Science, 2013.
[14] Asef Pour masoomi, Mohsen Kahani, Seyyed Ahmad Toosi and Ahmad Estiri, "Ijaz: An Operational system for single-document summarization of Persian news texts," Signal and Data Processing, 2014.
[15] T. Strutz, Data Fitting and Uncertainty A practical introduction to weighted least squares and beyond, Leipzig, Germany, 2010.
[16] B. B. Moghaddas, M. Kahani, S. A. Toosi, AsefPourmasoumi and A. Estiri, "Pasokh: A Standard Corpus for the Evaluation of Persian Text Summarizers," 3rd International Conference on Computer and Knowledge Engineering, 2013.
[17] Saeed Farzi and Sahar Kianian, "Katibeh: A Persian news summarizer using the novel semi-supervised approach," Digital Scholarship in the Humanities, 2018.
[18] Mohammad Fakhredanesh, Mohammad Ebrahim Khademi and Seyed Mojtaba Hoseini, "Farsi Conceptual Text Summarizer: A New Model in Continuous Vector Space".
[19] H. M. H. I. A. A. M. Al-Zahrani, "PSO-Based Feature Selection for Arabic Text Summarization," Computer Science, 2015.
[20] Mohammed Binwahlan, Naomie Salim and Ladda Suanmali, "Swarm Based Text Summarization," International Association of Computer Science and Information Technology, 2009.
[21] S. Miles, L. Yao, W. Meng, C. M. Black and Z. B. Miled, "Topic Extraction from A Cancer Health Forum," IEEE 9th International Conference on Healthcare Informatics (ICHI), 2021.
[22] Samuel Miles, Lixia Yao, Weilin Meng, Christopher M. Black and Zina Ben Miled, "Comparing PSO-based clustering over contextual vector embeddings to modern topic modeling," Information Processing & Management, vol. 59, no. 3, 2022.
[23] Shrabanti Mandal, Girish Kumar Singh and Anita Pal, "Single document text summarization technique using optimal combination of cuckoo search algorithm, sentence scoring and sentiment score," International Journal of Information Technology, vol. 13, p. 1805–1813, 2021.
[24] R. Z. Al-Abdallah and A. T. Al-Taani, "Arabic Text Summarization using Firefly Algorithm," IEEE, 2019.
[25] Kaleab Getaneh Tefrie and Kyung-Ah Sohn, "Autonomous Text Summarization Using Collective Intelligence Based on Nature-Inspired Algorithm," International Conference on Mobile and Wireless Technology, p. 455–464, 2017.
[26] Jesus M. Sanchez-Gomez, Miguel A. Vega-Rodríguez and Carlos J. Pérez, "Extractive Multi-Document Text Summarization Using a Multi-Objective Artificial Bee Colony Optimization Approach," Knowledge-Based Systems, vol. 159, 2017.
[27] Richa Sharma, Sudha Morwal and Basant Agarwal, "Named entity recognition using neural language model and CRF for Hindi language," Computer Speech & Language, vol. 74, 2020.
[28] Ahmad Alhasan and Ahmad T. Al-Taani, "POS Tagging for Arabic Text Using Bee Colony Algorithm," Procedia Computer Science, vol. 142, 2018.
[29] Ling Zhao, Ailian Zhang, Ying Liu and Hao Fei, "Encoding multi-granularity structural information for joint Chinese word segmentation and POS tagging," Pattern Recognition Letters, vol. 138, pp. 163-169, 2020.
[30] Roger C. Schank, "Conceptual dependency: A theory of natural language understanding," Cognitive Psychology, pp. 552-631, 1972.
[31] E. &. A. F. T. W. Hunt, "The Whorfian hypothesis: A cognitive psychology perspective.," Psychological Review, p. 377–389, 1991.
[32] Aubrey L. Gilbert, Terry Regier, Paul Kay and Richard B. Ivry, "Whorf hypothesis is supported in the right visual field but not the left," Biological Sciences, pp. 489-494, 2005.
[33] Rochel Gelman and C. R. Gallistel, "Language and the Origin of Numerical Concepts," Science, pp. 441-443, 2004.
[34] Rafael E. Núñez and Eve Sweetser, "With the Future Behind Them: Convergent Evidence From Aymara Language and Gesture in the Crosslinguistic Comparison of Spatial Construals of Time," Cognitive Science Society, Inc, 2005.
[35] Jennie E. Pyers and Ann Senghas, "Language Promotes False-Belief Understanding: Evidence From Learners of a New Sign Language," Psychological Science, 2009.
[36] Simpson-Finch H, Yohe Moore ES, Brandt B, Poepsel T, Heinzman A and Dempsey S, "PMU61 - Linguistic and Cultural Considerations When Implementing A Global ‘Bring your Own Device’ (BYOD) Study," Value in Health, vol. 21, pp. 590-591, 2018.
[37] Ángel Hernández-Castañeda, René Arnulfo García-Hernández, Yulia Ledeneva and Christian Eduardo Millán-Hernández, "Language-independent extractive automatic text summarization based on automatic keyword extraction," Computer Speech & Language, vol. 71, 2022.
[38] Abdhul Ahadh, Govind Vallabhasseri Binish and Rajagopalan Srinivasan, "Text mining of accident reports using semi-supervised keyword extraction and topic modeling," Process Safety and Environmental Protection, vol. 155, pp. 455-465, 2021.
-
کلمات کلیدی به فارسی :
خلاصه سازی تک سندی، پردازش زبان، تحلیل صرف ونحوی، الگوریتم رتبه صفحه.
-
چکیده مقاله به انگلیسی :
-
کلمات کلیدی به انگلیسی :
- صفحات : 33-47
-
دانلود فایل
( 850.19 KB )