With the rapid development of the Internet,the Internet has become an important way to access to news and information.How to access relevant information more conveniently,more comprehensively and more accurately (改了原文)has become an issue.As traditional network media (注意:这是复数词,plural) with dedicated websites (dedicated,是不是比single好呢?)no longer satisfy the needs of users,the notion of news search engine emerges.With growing popularity of mobile phones and continual improvement of its convenience(convenience不明确,是不是features/usability),mobile news search is becoming (has become,应该还没有成气候吧,还在发展中啊) a trend.
In this paper,a number of key technologies of the Mobile Chinese news search engine(是不是Chinese mobile search engines on news或mobile search engines on Chinese news?) have been deeply analyzed (analysised改错字)and researched,and a prototype system has been conceptualized (realized,有点武断,或者用has been schemed/will be presented/will be introduced).The study includes the following main points:
1) Design and implementation of a news HTML pages text extraction algorithm based on the characteristics of human vision.The algorithm is based on the judgment of text (as people删除,意思不明).According to (去掉the) factors including the number (用count会不会好一点)of Chinese characters,the number of hot words (用frequency会不会好一点),the number of hyperlinks,(可不可以把the number of去掉)(a,删除) certain paragraphs (用patterns 或 paragraph patterns,)of text can be determined.(后一句应该是new sentence,意思不明) Then by using the relationship of HTML nodes(改成relationship governing HTML nodes会不会好一点),the text of the news can be abstracted (or extracted).Experiments indicate that with this method,text of the news pages can be accurately extracted,and other irrelevant (错字unrelevant,改成irrelevant and redundant sections会不会好一点?) parts such as advertisements (换表达方式)can be well removed (是不是can be removed as well?),unlike (可以连接句子)traditional extraction method (methods?),without pre-learning,that unnecessarily adjust configuration according to different websites and different channels.(改了表达形式)
2)Design of mobile Chinese news search engine system,the concrete realization of the program,achieving a system prototype and made a number of improvements to the users’ experience as the next phase of work.(缺乏parallelism.)
“made a number of improvements to the users’ experience as the next phase of work.”句子意思不明.next phase of work是发生了,还是还没发生?还没发生,用suggested...for the next phase of work.“next”预示下一步、跟着下来的工作.
Design of a mobile Chinese news search engine system,and the actual realization of this program and a prototype with a number of improvements made according to users experience in the second work phase.