- 4
- 0
- 约4.74万字
- 约 50页
- 2019-02-22 发布于上海
- 举报
哈尔滨工业大学工学硕士学位论文
哈尔滨工业大学工学硕士学位论文
II
II
Abstract
In recent years, with the rapid development of web2.0, the Internet has been expanded into a huge amount of data and content-rich information carriers. Emergence of some new form of knowledge services, which has strong interaction with user, typically like cyclopedia knowledge, personal blog, forums, etc. among the online services, the forum allow the users to raise and discuss issues, share information, post freely and simply, so the forum has a high timeliness and is accepted by the majority of the users. How to make full use of the data in the financial sector forum, organize and mine useful information of the massive data, in order to provide access to the user is the main content of the paper. This paper mainly includes two aspects below:
Firstly, set up the forum vertical search engine. According to processes of the general search engine, the system completes the spider modules, web data extraction and indexing modules and the query sorting module in turn. According to the financial forum vertical search engine, the implementation of each part has its own characteristics. For example, in the spider module, the crawling strategy is given the daily hot stocks more frequency of crawling to improve the timeliness overall the system. In the query sorting module, the system not only provides the relevant sorting like the general search engine, but also provides sorting according to the number of hits, replies and timeliness of the post.
Secondly, the paper turns to financial forum data mining to provide a more humane, intelligent service to the users. The main work includes test classification for the posts in the forum. After the word segmentation and text feature extraction, we use na?ve Bayesian algorithm to classify the data. Next improved Bayesian algorithm is proposed based one motional dictionary. The paper uses the knowledge of hownet database to effectively improve the performance and accuracy of classificat
您可能关注的文档
- 金融监管制度的国际比较分析-应用经济学专业论文.docx
- 金融监管制度的国际比较研究-农业经济管理专业论文.docx
- 金融监管中的政府行为研究-政治经济学专业论文.docx
- 金融交易系统的数据灾备技术及其应用研究-计算机技术专业论文.docx
- 金融交易系统的灾备技术研究-软件工程专业论文.docx
- 金融结构变化中长期视角的金融安全网构建-金融学专业论文.docx
- 金融结构对养老基金投资结构的影响研究-应用经济学;金融学专业论文.docx
- 金融结构多元化与货币政策传导效率-金融学专业论文.docx
- 金融结构演变与产业结构升级关系研究金融学专业论文.docx
- 金融结构与金融稳定关系研究金融学专业论文.docx
最近下载
- 揭煤地质说明书2020.5.28.doc VIP
- CB 20652-2018CN 舰船燃气轮机轮盘超转试验方法.docx
- DB37_T 5345-2025《建筑工程流态固化土应用技术规程》.pdf
- 2025年房地产经纪人智慧社区数据化运营与决策支持专题试卷及解析.pdf VIP
- 2025年拍卖师拍卖活动备案与监管流程专题试卷及解析.pdf VIP
- 2025年特许金融分析师行为经济学与宏观经济决策专题试卷及解析.pdf VIP
- 2025年拍卖师无形资产拍卖纠纷的典型案例分析与裁判思路专题试卷及解析.pdf VIP
- 面向自动驾驶场景的联邦学习硬件加速与车载通信协议深度集成研究.pdf VIP
- 2025《基于NB-IoT的烟雾报警系统设计》21000字.docx
- 小吃街夜市规划方案 (2).docx VIP
原创力文档

文档评论(0)