- 1、本文档共9页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
2013.8U-Air-When Urban Air Quality Inference Meets Big Data
U-Air: When Urban Air Quality Inference Meets Big Data
Yu Zheng, Furui Liu, Hsun-Ping Hsieh
Microsoft Research Asia, Beijing China
{yuzheng, v-ful, v-hshsie}@
ABSTRACT
Information about urban air quality, e.g., the concentration of
PM2.5, is of great importance to protect human health and control
air pollution. While there are limited air-quality-monitor-stations
in a city, air quality varies in urban spaces non-linearly and
depends on multiple factors, such as meteorology, traffic volume,
and land uses. In this paper, we infer the real-time and fine-
grained air quality information throughout a city, based on the
(historical and real-time) air quality data reported by existing
monitor stations and a variety of data sources we observed in the
city, such as meteorology, traffic flow, human mobility, structure
of road networks, and point of interests (POIs). We propose a
semi-supervised learning approach based on a co-training
framework that consists of two separated classifiers. One is a
spatial classifier based on an artificial neural network (ANN),
which takes spatially-related features (e.g., the density of POIs
and length of highways) as input to model the spatial correlation
between air qualities of different locations. The other is a
temporal classifier based on a linear-chain conditional random
field (CRF), involving temporally-related features (e.g., traffic
and meteorology) to model the temporal dependency of air quality
in a location. We evaluated our approach with extensive
experiments based on five real data sources obtained in Beijing
and Shanghai. The results show the advantages of our method
over four categories of baselines, including linear/Gaussian
interpolations, classical dispersion models, well-known
classification models like decision tree and CRF, and ANN.
Categories and Subject Descriptors
H.2.8 [Database Management]: Database Applications - data
mining, Spatial databases and GIS;
General Terms
Algorithms, M
您可能关注的文档
- 08-北京去处 282-311.pdf
- 1 Arabidopsis Centromeres 拟南芥着丝粒.pdf
- 1 A Logic-Constrained Knapsack Formulation and a Tabu Algorithm for the Daily Photograph S.pdf
- 1 Modeling research of background noise in low-voltage power line communication channel.pdf
- 1 Introduction Modelling of chronic wound healing.pdf
- 1 Numerical Methods in Continuum Mechanics 2000, Liptovsky Ján, Slovak Republic Symbolic c.pdf
- 1 Permanent Member.pdf
- 1 PAO与PAG比较Comparison of PAO and PAG.pdf
- 1 Some Practical Advice for Dealing with Semantic Heterogeneity in Federated.pdf
- 1 Text analysis meets corpus linguistics.pdf
- 2013下半年高二英语期末试卷.doc
- 2013—2014高二年级第一学期第一次月考英语试题(普通班)答题纸.doc
- 2013同等学力英语同等学力英语模考卷(二).pdf
- 2013年3月3日托福考试独立写作范文-智课教育旗下智课教育.pdf
- 2013年4月27日GRE写作考题解析.pdf
- 2013年5月五期,渐行渐远的童年 Childhood going further away.pdf
- 2013年8月24日雅思口语机经part1整理.pdf
- 2013年九年级英语全册 Unit 13 Rainy days make me sad 教学案.doc
- 2013年加拿大大学雅思录取分数(最低).pdf
- 2013年北京语言大学翻译硕士专业考研复试分数线-考研真题及答案解析.pdf
文档评论(0)