Skip to content

GitLab

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
K
kb
  • Project overview
    • Project overview
    • Details
    • Activity
    • Releases
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 2
    • Issues 2
    • List
    • Boards
    • Labels
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • Operations
    • Operations
    • Incidents
  • Analytics
    • Analytics
    • Repository
    • Value Stream
  • Wiki
    • Wiki
  • Members
    • Members
  • Activity
  • Graph
  • Create a new issue
  • Commits
  • Issue Boards
Collapse sidebar
  • granite
  • kb
  • Wiki
    • Data_stream
  • environmental_protection_grade

environmental_protection_grade · Changes

Page history
update: 环保等级: 结果字段解析更新 authored Oct 29, 2021 by 蒋家升's avatar 蒋家升
Hide whitespace changes
Inline Side-by-side
Showing with 130 additions and 6 deletions
+130 -6
  • data_stream/environmental_protection_grade.md data_stream/environmental_protection_grade.md +130 -6
  • No files found.
data_stream/environmental_protection_grade.md
View page @ ce1b57a7
......@@ -137,8 +137,8 @@ list_of_red: 红黑榜
## 实际爬虫结果的数据结构
<!--可能与超级数据一致,可能不同的data_type的爬虫结果结构不同,超级数据是把所有data_type的结果组合在一起-->
#### 江苏:
```json
江苏:
{
"data":
[
......@@ -197,8 +197,9 @@ list_of_red: 红黑榜
"spider_name": "environmental_protection",
"spider_ip": "10.8.1.18"
}
浙江:
```
#### 浙江:
```json
{
"data":
[
......@@ -256,8 +257,9 @@ list_of_red: 红黑榜
"spider_name": "environmental_protection",
"spider_ip": "10.8.1.18"
}
福建:
```
#### 福建:
```JSON
{
"data":
[
......@@ -306,8 +308,10 @@ list_of_red: 红黑榜
"spider_name": "environmental_protection",
"spider_ip": "10.8.6.51"
}
```
四川:
#### 四川:
```json
{
"data":
[
......@@ -443,6 +447,126 @@ list_of_red: 红黑榜
"spider_ip": "10.8.1.18"
}
```
> [四川字段解析](data_stream/environmental_protection_related/sichuan_field.json)
#### 湖南:
```json
{
"data":
[
{
"SSQX": "雨花区", # 区县
"GSDJ": "环保合格单位", # 信用等级(与年度字段的年份相关)
"ND": "2020", # 年度
"QYMC": "长沙博大环保科技有限公司", # 企事业单位名称
"SSDS": "长沙市", # 市州
"GXSJ": "2021年09月27日", # 更新时间
"TYSHXYDM": "91430111344823182Y", # 统一社会信用代码
"CPDJ": 2, # 参评等级
"ZXDJ": "环保合格单位" # 当前信用等级
},
{
"SSQX": "保靖县",
"GSDJ": "环保合格单位",
"ND": "2020",
"QYMC": "保靖县人民医院",
"SSDS": "湘西土家族苗族自治州",
"GXSJ": "2021年09月27日",
"TYSHXYDM": "12433125448636058Q",
"CPDJ": 2,
"ZXDJ": "环保合格单位"
}
],
"http_code": 200,
"error_msg": "",
"task_result": 1000,
"data_type": "list",
"spider_start_time": "2021-10-29 16:03:08.162",
"spider_end_time": "2021-10-29 16:03:10",
"task_params": {"province": "hunan","step": "start","index": 1},
"metadata": {"province": "hunan","index": 1},
"spider_name": "environmental_protection",
"spider_ip": "10.8.1.10"
}
```
#### 河南:
```json
{
"data":
[
{
"belongsBasin": "1",
"companyAddress": "郑州市中原区桐柏南路158号",
"companyLevel": "2",
"companyName": "河南(郑州)中汇心血管病医院", # 企业事业单位名称
"contactAddress": "郑州市中原区桐柏南路158号",
"contactNumber": "",
"contactPerson": "",
"contactTelphone": "15290405687",
"contactUser": "周毅鹏",
"contactWechat": "zyp15290405687",
"controlLimit": "",
"createTime": "2021-06-09 14:16:43",
"createUser": "d857c4afdcc047f1a1fa70417df5f5f7",
"emissionLimits": "",
"emissionsTo": "",
"enabledStatus": "1",
"evaluateDate": "2021-09-24", # 评级时间
"evaluation": "1",
"exhaustType": "01,02,04",
"finalResult": "警示", # 等级
"hasInit": false,
"id": "0351cdb38249462f8589d065f8a207af",
"industry": "",
"industryInvolved": "Q8415",
"isUsed": "1",
"legalRepresentative": "毛慧娟", # 法人
"officePhone": "",
"orgCode": "ceaa1f4652ae4d73a2bf82d36eb2e325",
"orgName": "中原区生态环境局", # 评级单位
"organizationCode": "52410100MJF72040XB",
"outletName": "",
"pKName": "id",
"parentCode": "410100",
"pollutantName": "",
"postcode": "450000",
"pregion": "郑州市", # 城市
"productionDate": "2009-11-05",
"region": "中原区", # 区县
"regionCode": "410102",
"regionName": "",
"registeredAddress": "",
"remarks": "", # 备注
"scores": "75.0",
"unicode": "52410100MJF72040XB", # 统一社会信用代码
"updateTime": "2021-07-08 15:57:38",
"updateUser": "cd83c18f52a847bdb3466dacebc80515"
}
],
"http_code": 200,
"error_msg": "",
"task_result": 1000,
"data_type": "list",
"spider_start_time": "2021-10-28 15:31:48.890",
"spider_end_time": "2021-10-28 15:33:17",
"task_params":
{
"province": "henan",
"step": "start",
"city": "郑州市",
"index": 1
},
"metadata":
{
"province": "henan",
"city": "郑州市",
"index": 1
},
"spider_name": "environmental_protection",
"spider_ip": "10.8.1.18"
}
```
## 爬虫运行环境
<!--udm模块?scrapy?或其他-->
......
Clone repository
  • README
  • basic_guidelines
  • basic_guidelines
    • basic_guidelines
    • dev_guide
    • project_build
    • 开发流程
  • best_practice
  • best_practice
    • AlterTable
    • RDS
    • azkaban
    • create_table
    • design
    • elasticsearch
    • elasticsearch
      • ES运维
    • logstash
View All Pages