Skip to content

Commit c0f3449

Browse files
GH-72, 77: Fix data inconsistencies and remove derived values across years for all datasets (#92)
* fix: standardize column names in various datasets * fix typo * fix: standardize purpose names across multiple datasets * fix: remove 'Total' column from age group datasets for consistency * fix: remove unnecessary columns from gender registration datasets for consistency * fix: remove 'Percentage(%)' column from foreign employment registration datasets for consistency * fix: remove 'Total' column from foreign employment registration dataset for consistency * fix: standardize column name 'No. of Detainees removed' to 'Number of Deportations' across multiple datasets for consistency * fix: remove 'Total' column from age group datasets for consistency * Remove "Total" column, row and update district names for consistency in SLBFE registration data by district, manpower level, and gender for 2020, 2021, 2022, and 2023 * fix: standardize column names for across multiple datasets for slbfe registrations by district, manpower and gender for consistency * fix inconsistencies columns in slbfe registration by manpower level * update metadata column counts for slbfe registrations by age group * update metadata column counts for slbfe registrations by gender * update metadate column counts and remove total row from slbfe registration by country datasets * updates metadata column counts for slbfe registrations by age and manpower * update metadata row and column counts for slbfe registrations by district, manpower level and gender * update 2024 slbfe manpower vs gender dataset column names to match the other years * normalise coulmn names and remove total values in slbfe registration by country vs manpower level, update metadata files * (Tourism) Normalize country names across arrivals by country datasets * (Toursim) Normalise country names across arrivals by month vs country datasets * update row counts in metadata for arrivals by month vs country * remove 'others' from top 10 source markets * (SLBFE) normalise country names across datasets for registration by country vs manpower level * fix typos and normalise nationality names across asylum seekers datasets * fix typos and normalise nationality names across deported foreign nationals datasets * fix typos and normalise nationality names across refugees by nationality datasets * fix typos and normalise nationality names across refused entry by nationality datasets * fix typos in nationalities * normalise place names for location vs revenue vs visitors count datasets * remove percentage value from workers remittances across the years * remove total value from arrivals by month * remove derived values for slbfe registration by all sources vs country and private remittances * normalise country names for slbfe registratiion by all sources vs country * remove total values from workers' remittances by country dataset * Update data/2019/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)/Minister of Telecommunication, Foreign Employment and Sports(minister)/Sri Lanka Foreign Employment Bureau(department)/slbfe_registration(AS_CATEGORY)/by_country(AS_CATEGORY)/SLBFE_registration_by_country/data.json Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Update data/2020/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)/State Minister of Foreign Employment Promotion and Market Diversification(minister)/Sri Lanka Foreign Employment Bureau(department)/foreign_employment(AS_CATEGORY)/slbfe_registration(AS_CATEGORY)/by_country(AS_CATEGORY)/SLBFE registration by country/data.json Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
1 parent ad001e6 commit c0f3449

140 files changed

Lines changed: 1266 additions & 1216 deletions

File tree

  • data
    • 2019/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)
      • Minister of Telecommunication, Foreign Employment and Sports(minister)/Sri Lanka Foreign Employment Bureau(department)
      • Minister of Tourism Development, Wildlife and Christian Religious Affairs(minister)/Sri Lanka Tourism Development Authority(department)
        • arrivals(AS_CATEGORY)
          • by_purpose(AS_CATEGORY)/tourist_arrivals_by_purpose_of_visit
          • by_sex(AS_CATEGORY)/tourist_arrivals_by_gender
          • month_vs_country(AS_CATEGORY)/tourist_arrivals_by_country_and_month
        • location_vs_revenue_vs_visitors_count(AS_CATEGORY)/Tourist Attractions Revenue and Visitors
        • occupancy_rate(AS_CATEGORY)
          • by_district(AS_CATEGORY)/occupancy_rates_by_district
          • by_month(AS_CATEGORY)/occupancy_rates_by_month
    • 2020/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)
      • Minister of Tourism and Civil Aviation(minister)/Sri Lanka Tourism Development Authority(department)/tourism(AS_CATEGORY)
        • arrivals(AS_CATEGORY)
        • location_vs_revenue_vs_visitors_count(AS_CATEGORY)/Location vs Revenue vs Visitors Count
      • State Minister of Foreign Employment Promotion and Market Diversification(minister)/Sri Lanka Foreign Employment Bureau(department)/foreign_employment(AS_CATEGORY)
      • State Minister of Internal Security, Home affairs and Disaster Management(minister)/Department of Immigration and Emigration(department)/immigration_and_emigration(AS_CATEGORY)
        • asylum_seekers(AS_CATEGORY)/asylum_seekers_by_nationality
        • deported_foreign_nationals(AS_CATEGORY)/deportations_by_nationality
        • refugees(AS_CATEGORY)/refugees_by_nationality
    • 2021/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)
      • Minister of Tourism(minister)/Sri Lanka Tourism Development Authority(department)/tourism(AS_CATEGORY)/arrivals(AS_CATEGORY)
      • State Minister of Foreign Employment Promotion and Market Diversification(minister)/Sri Lanka Foreign Employment Bureau(department)/foreign_employment(AS_CATEGORY)
      • State Minister of National Security, Home Affairs and Disaster Management(minister)/Department of Immigration and Emigration(department)/immigration_and_emigration(AS_CATEGORY)
        • asylum_seekers(AS_CATEGORY)/asylum_seekers_by_nationality
        • deported_foreign_nationals(AS_CATEGORY)/deportations_by_nationality
        • refugees(AS_CATEGORY)/refugees_by_nationality
        • refused_foreign_entry(AS_CATEGORY)/refused_entry_by_nationality
    • 2022/Government of Sri Lanka(government)/Ranil Wickremesinghe(citizen)
      • Minister of Investment Promotion(minister)/Department of Immigration and Emigration(department)/immigration_and_emigration(AS_CATEGORY)
        • deported_foreign_nationals(AS_CATEGORY)/deportations_by_nationality
        • refugees(AS_CATEGORY)/refugees_by_nationality
        • refused_foreign_entry(AS_CATEGORY)/refused_entry_by_nationality
      • Minister of Labour and Foreign Employment(minister)/Sri Lanka Foreign Employment Bureau(department)/foreign_employment(AS_CATEGORY)
      • Minister of Tourism and Lands(minister)/Sri Lanka Tourism Development Authority(department)/tourism(AS_CATEGORY)/arrivals(AS_CATEGORY)
    • 2023/Government of Sri Lanka(government)/Ranil Wickremesinghe(citizen)
      • Minister of Investment Planning(minister)/Department of Immigration and Emigration(department)/immigration_and_emigration(AS_CATEGORY)
        • asylum_seekers(AS_CATEGORY)/asylum_seekers_by_nationality
        • deported_foreign_nationals(AS_CATEGORY)/deportations_by_nationality
        • fake_passports(AS_CATEGORY)/fake_passport_detection_by_nationality
        • fraudulent_visa(AS_CATEGORY)/fraudulent_visa_detection_by_nationality
        • refugees(AS_CATEGORY)/refugees_by_nationality
        • refused_foreign_entry(AS_CATEGORY)/refused_entry_by_nationality
      • Minister of Labour and Foreign Employment(minister)/Sri Lanka Foreign Employment Bureau(department)/foreign_employment(AS_CATEGORY)
      • Minister of Tourism and Lands(minister)/Sri Lanka Tourism Development Authority(department)/tourism(AS_CATEGORY)
        • arrivals(AS_CATEGORY)
          • by_country(AS_CATEGORY)/Arrivals by Country
          • by_month(AS_CATEGORY)/Arrivals by Month
          • by_purpose(AS_CATEGORY)/Arrivals by Purpose
          • month_vs_country(AS_CATEGORY)/Arrivals by Month vs Country
        • location_vs_revenue_vs_visitors_count(AS_CATEGORY)/Location vs Revenue vs Visitor Count
    • 2024/Government of Sri Lanka(government)/Anura Kumara Dissanayake(citizen)
      • Minister of Foreign Affairs, Foreign Employment and Tourism(minister)
        • Sri Lanka Foreign Employment Bureau(department)/foreign_employment(AS_CATEGORY)/slbfe_registration(AS_CATEGORY)
        • Sri Lanka Tourism Development Authority(department)/tourism(AS_CATEGORY)
          • accommodations(AS_CATEGORY)/by_province(AS_CATEGORY)/Accommodations by Province
          • arrivals(AS_CATEGORY)
          • top_10_source_markets(AS_CATEGORY)/Top 10 source markets
      • Minister of Public Security and Parliamentary Affairs(minister)/Department of Immigration and Emigration(department)/immigration_and_emigration(AS_CATEGORY)
        • deported_foreign_nationals(AS_CATEGORY)/deportations_by_nationality
        • refugees(AS_CATEGORY)/refugees_by_nationality

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
Original file line numberDiff line numberDiff line change
@@ -1,62 +1,61 @@
11
{
2-
"columns": ["Country", "Number", "Percentage(%)"],
2+
"columns": ["Country", "Number"],
33
"rows": [
4-
["Saudi Arabia", 35415, 17.44],
5-
["Kuwait", 43073, 21.21],
6-
["U.A.E.", 328690, 16.18],
7-
["Qatar", 40783, 20.08],
8-
["Oman", 9016, 4.439],
9-
["South Korea", 6207, 3.056],
10-
["Maldives", 7767, 3.824],
11-
["Jordan", 4611, 2.27],
12-
["Bahrain", 3017, 1.486],
13-
["Lebanon", 1902, 0.937],
14-
["Malaysia", 3296, 1.623],
15-
["Israel", 1559, 0.768],
16-
["Cyprus", 2421, 1.192],
17-
["Singapore", 2124, 1.046],
18-
["Seychelles", 977, 0.481],
19-
["Bangladesh", 576, 0.284],
20-
["Hong Kong", 624, 0.307],
21-
["Mauritius", 91, 0.045],
22-
["Iraq", 183, 0.09],
23-
["Kurdistan", 156, 0.077],
24-
["India",244,0.12],
25-
["Afghanistan", 109, 0.054],
26-
["Japan", 1594, 0.785],
27-
["New Zealand", 460, 0.227],
28-
["Romania", 2312, 1.138],
29-
["Papua New Guinea", 69, 0.034],
30-
["Ethiopia", 296, 0.146],
31-
["Italy", 72, 0.035],
32-
["Kenya", 37, 0.018],
33-
["Uganda", 52, 0.026],
34-
["Pakistan", 38, 0.019],
35-
["Egypt", 68, 0.033],
36-
["Fiji", 30, 0.015],
37-
["Sudan", 17, 0.008],
38-
["Brunei", 17, 0.008],
39-
["Vietnam", 40, 0.02],
40-
["Australia", 40, 0.02],
41-
["Ireland",27,0.013],
42-
["Yemen", 2, 0.0003],
43-
["Libya", 0, 0.00],
44-
["United States", 15, 0.007],
45-
["China", 17, 0.008],
46-
["Greece", 28, 0.014],
47-
["Malta", 111, 0.055],
48-
["Myanmar",19,0.009],
49-
["Botswana", 19, 0.009],
50-
["Malawi",21,0.01],
51-
["Thailand", 31, 0.015],
52-
["South Africa", 8, 0.004],
53-
["United Kingdom", 25, 0.012],
54-
["Turkey", 68, 0.033],
55-
["Mozambique", 17, 0.008],
56-
["Lithuania", 73, 0.036],
57-
["Angola", 8, 0.004],
58-
["Djibouti", 10, 0.005],
59-
["Other", 435, 0.214],
60-
["Total", 203087, 100.00]
4+
["Saudi Arabia", 35415],
5+
["Kuwait", 43073],
6+
["U.A.E.", 328690],
7+
["Qatar", 40783],
8+
["Oman", 9016],
9+
["South Korea", 6207],
10+
["Maldives", 7767],
11+
["Jordan", 4611],
12+
["Bahrain", 3017],
13+
["Lebanon", 1902],
14+
["Malaysia", 3296],
15+
["Israel", 1559],
16+
["Cyprus", 2421],
17+
["Singapore", 2124],
18+
["Seychelles", 977],
19+
["Bangladesh", 576],
20+
["Hong Kong", 624],
21+
["Mauritius", 91],
22+
["Iraq", 183],
23+
["Kurdistan", 156],
24+
["India", 244],
25+
["Afghanistan", 109],
26+
["Japan", 1594],
27+
["New Zealand", 460],
28+
["Romania", 2312],
29+
["Papua New Guinea", 69],
30+
["Ethiopia", 296],
31+
["Italy", 72],
32+
["Kenya", 37],
33+
["Uganda", 52],
34+
["Pakistan", 38],
35+
["Egypt", 68],
36+
["Fiji", 30],
37+
["Sudan", 17],
38+
["Brunei", 17],
39+
["Vietnam", 40],
40+
["Australia", 40],
41+
["Ireland", 27],
42+
["Yemen", 2],
43+
["Libya", 0],
44+
["United States", 15],
45+
["China", 17],
46+
["Greece", 28],
47+
["Malta", 111],
48+
["Myanmar",19],
49+
["Botswana", 19],
50+
["Malawi",21],
51+
["Thailand", 31],
52+
["South Africa", 8],
53+
["United Kingdom", 25],
54+
["Turkey", 68],
55+
["Mozambique", 17],
56+
["Lithuania", 73],
57+
["Angola", 8],
58+
["Djibouti", 10],
59+
["Other", 435]
6160
]
6261
}

data/2019/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)/Minister of Telecommunication, Foreign Employment and Sports(minister)/Sri Lanka Foreign Employment Bureau(department)/slbfe_registration(AS_CATEGORY)/by_country(AS_CATEGORY)/SLBFE_registration_by_country/metadata.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
{
22
"storage_type": "tabular",
33
"dataset_name": "SLBFE registration by country",
4-
"column_count": 3,
5-
"row_count": 57,
4+
"column_count": 2,
5+
"row_count": 56,
66
"node_count": null,
77
"edges_count": null,
88
"resource": "https://www.slbfe.lk/wp-content/uploads/2023/09/Statistics-2020.pdf",
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
{
2-
"columns": ["Year","Male (Number)", "Male (%)", "Female (Number)", "Female (%)","Total"],
2+
"columns": ["Year","Male", "Female"],
33
"rows": [
4-
["2019",122257,60.20,80830,39.80,203087]
4+
["2019",122257,80830]
55
]
66
}

data/2019/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)/Minister of Telecommunication, Foreign Employment and Sports(minister)/Sri Lanka Foreign Employment Bureau(department)/slbfe_registration(AS_CATEGORY)/by_gender(AS_CATEGORY)/SLBFE_registration_by_gender/metadata.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
{
22
"storage_type": "tabular",
33
"dataset_name": "SLBFE registration by gender",
4-
"column_count": 6,
4+
"column_count": 3,
55
"row_count": 1,
66
"node_count": null,
77
"edges_count": null,
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,24 @@
11
{
2-
"columns": ["Year", "Professional Level (Number)", "Professional Level (%)", "Middle Level (Number)", "Middle Level (%)", "Skilled (Number)", "Skilled (%)", "Clerical & Related (Number)", "Clerical & Related (%)", "Semi Skilled Domestic housekeeping assistants (Number)", "Semi-Skilled Others (Number)","Semi-Skilled Others (Total)", "Semi-Skilled Others (%)", "Low skilled (Number)", "Low skilled (%)", "Total"],
2+
"columns": [
3+
"Year",
4+
"Professional Level",
5+
"Middle Level",
6+
"Skilled",
7+
"Clerical & Related",
8+
"Semi Skilled Domestic housekeeping assistants",
9+
"Semi Skilled Others",
10+
"Low Skilled"
11+
],
312
"rows": [
4-
["2019", 9861, 4.85, 5725, 2.82, 62711, 30.88, 9163, 4.51, 61489, 2950, 64439, 31.73, 51188, 25.20, 203087]
13+
[
14+
"2019",
15+
9861,
16+
5725,
17+
62711,
18+
9163,
19+
61489,
20+
2950,
21+
51188
22+
]
523
]
624
}

data/2019/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)/Minister of Telecommunication, Foreign Employment and Sports(minister)/Sri Lanka Foreign Employment Bureau(department)/slbfe_registration(AS_CATEGORY)/by_manpower_level(AS_CATEGORY)/SLBFE_registration_by_manpower_level/metadata.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
{
22
"storage_type": "tabular",
33
"dataset_name": "SLBFE registration by manpower level",
4-
"column_count": 16,
4+
"column_count": 8,
55
"row_count": 1,
66
"node_count": null,
77
"edges_count": null,
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
{
2-
"columns": ["Year", "Remittances Middle East (Rs. Million)", "Remittances Middle East (US$ Million)", "Remittances Total (Rs. Million)", "Remittances Total (US$ Million)", "Middle East as a % of total remittance"],
2+
"columns": ["Year", "Remittances Middle East (Rs. Million)", "Remittances Middle East (US$ Million)", "Remittances Total (Rs. Million)", "Remittances Total (US$ Million)"],
33
"rows": [
4-
["2019", 618394, 3459, 1200766, 6717, 51.5]
4+
["2019", 618394, 3459, 1200766, 6717]
55
]
66
}

data/2019/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)/Minister of Telecommunication, Foreign Employment and Sports(minister)/Sri Lanka Foreign Employment Bureau(department)/workers_remittances(AS_CATEGORY)/metadata.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
{
22
"storage_type": "tabular",
33
"dataset_name": "Workers Remittances",
4-
"column_count": 6,
4+
"column_count": 5,
55
"row_count": 1,
66
"node_count": null,
77
"edges_count": null,

data/2019/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)/Minister of Tourism Development, Wildlife and Christian Religious Affairs(minister)/Sri Lanka Tourism Development Authority(department)/arrivals(AS_CATEGORY)/by_purpose(AS_CATEGORY)/tourist_arrivals_by_purpose_of_visit/data.json

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -5,27 +5,27 @@
55
],
66
"rows": [
77
[
8-
"Pleasure / Vacation",
8+
"Pleasure/Vacation",
99
83.2
1010
],
1111
[
12-
"Visiting friends & relatives",
12+
"Visiting Friends & Relatives",
1313
10.48
1414
],
1515
[
1616
"Business",
1717
3.65
1818
],
1919
[
20-
"Convention & Meeting",
20+
"Convention/Meeting",
2121
0.99
2222
],
2323
[
2424
"Sports",
2525
0.72
2626
],
2727
[
28-
"Health / Ayurveda",
28+
"Health/Ayurveda",
2929
0.59
3030
],
3131
[
@@ -37,7 +37,7 @@
3737
0.3
3838
],
3939
[
40-
"Others",
40+
"Other/Not Responded",
4141
0.1
4242
],
4343
[

data/2019/Government of Sri Lanka(government)/Gotabaya Rajapaksa(citizen)/Minister of Tourism Development, Wildlife and Christian Religious Affairs(minister)/Sri Lanka Tourism Development Authority(department)/arrivals(AS_CATEGORY)/by_sex(AS_CATEGORY)/tourist_arrivals_by_gender/data.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
{
22
"columns": [
3-
"Gender",
3+
"Sex",
44
"Percentage"
55
],
66
"rows": [

0 commit comments

Comments
 (0)