Arthur de Jong

Open Source / Free Software developer

summaryrefslogtreecommitdiffstats
path: root/update
Commit message (Collapse)AuthorAgeFilesLines
* Update database filesArthur de Jong2024-03-171-8/+14
| | | | | | | | | | | | | | The Belgian bpost bank no longer has a registration and a few bank account numbers in the tests that used that bank were removed. Also updates the update/gs1_ai.py script to handle the new format of the data published by GS1. Also update the GS1-128 module to handle some different date formats. The Pakistan entry was kept in the stdnum/iban.dat file because the PDF version of the IBAN Registry still contains the country. fix db
* Switch to using openpyxl for parsing XLSX filesArthur de Jong2024-03-173-20/+18
| | | | | | The xlrd has dropped support for parsing XLSX files. We still use xlrd for update/be_banks.py because they use the classic XLS format and openpyxl does not support that format.
* Update database filesArthur de Jong2023-08-201-4/+8
| | | | | | | | This also modifies the OUI update script because the website has changed to HTTPS and is sometimes very slow. The Belgian Commerzbank no longer has a registration and a bank account number in the tests used that bank.
* Remove EU NACE update scriptArthur de Jong2023-08-201-66/+0
| | | | | | | | | | | | | | | | | | The website that publishes the NACE catalogue has changed and a complete re-write of the script would be necessary. The data file hasn't changed since 2017 so is also unlikely to change until it is going to be replaced by NACE rev. 2.1 in 2025. See https://ec.europa.eu/eurostat/web/nace The NACE rev 2 specification can now be found here: https://showvoc.op.europa.eu/#/datasets/ESTAT_Statistical_Classification_of_Economic_Activities_in_the_European_Community_Rev._2/data The NACE rev 2.1 specification can now be found here: https://showvoc.op.europa.eu/#/datasets/ESTAT_Statistical_Classification_of_Economic_Activities_in_the_European_Community_Rev._2.1._%28NACE_2.1%29/data In both cases a ZIP file with RDF metadata can be downloaded (but the web applciation also exposes some simpler JSON APIs).
* Extend date parsing in GS1-128Arthur de Jong2023-03-181-2/+2
| | | | | Some new AIs have new date formats or have changed the way optional components of formats are defined.
* Increase timeout for CN Open Data downloadArthur de Jong2022-11-131-2/+2
| | | | It seems that raw.githubusercontent.com can be extremely slow.
* Pick up custom certificate from script pathArthur de Jong2022-11-131-2/+4
| | | | | | This ensures that the script can be run from any directory. Fixes c4ad714
* Ensure we always run flake8-bugbearArthur de Jong2022-11-131-1/+1
| | | | | This assumes that we no longer use Python 2.7 for running the flake8 tests any more.
* Do not print trailing spaceArthur de Jong2022-08-151-1/+1
|
* Ignore invalid downloaded country codesArthur de Jong2022-08-151-8/+9
| | | | | | The page currently lists a country without a country code (is listed as "-"). This also ensures that lists of country codes are handled consistently.
* Update names of Wikipedia pages with IMSI codesArthur de Jong2022-08-151-7/+7
|
* Provide a timeout to all download scriptsArthur de Jong2022-08-1515-17/+17
|
* Update ISIL download URLArthur de Jong2022-08-151-1/+1
|
* Add Czech bank account numberspetr.prikryl2022-08-141-0/+64
| | | | | Closes https://github.com/arthurdejong/python-stdnum/issues/295 Closes https://github.com/arthurdejong/python-stdnum/pull/296
* Put long line flake8 ignores in files instead of globallyArthur de Jong2022-08-131-1/+1
| | | | | | | | We have some long URLs in the code (mostly in docstrings) and wrapping them does not improve readability (and is difficult in docstrings) so the E501 ignore is now put inside each file instead of globally. Closes https://github.com/arthurdejong/python-stdnum/pull/302
* Fix flake8 errorArthur de Jong2022-08-121-1/+1
| | | | | This stops using not as a function and hopefully also makes the logic clearer.
* Add Classification of Financial InstrumentsArthur de Jong2022-02-131-0/+98
| | | | | | | This implements parsing of ISO 10962 CFI codes based on the published description of the structure of these numbers. Closes https://github.com/arthurdejong/python-stdnum/issues/283
* Rename README to README.mdArthur de Jong2021-10-031-2/+2
| | | | | | Mostly to please GitHub. See https://github.com/arthurdejong/python-stdnum/issues/280
* Update database filesArthur de Jong2021-10-031-2/+9
|
* Fix typos found by codespellDimitri Papadopoulos2021-08-104-5/+5
| | | | Closes https://github.com/arthurdejong/python-stdnum/pull/269
* Fix handling of empty worksheet in Belgian bank downloadDimitri Papadopoulos2021-08-101-2/+5
| | | | | | LGTM alert: Call to next() in a generator See: https://lgtm.com/rules/11000086/ Closes https://github.com/arthurdejong/python-stdnum/pull/270
* Stop non-operational MNCs from confusing IMSI datasetArthur de Jong2021-04-111-18/+27
| | | | | | | | | | | | | This only includes data from non-operational (status "Not operational" according to Wikipedia) Mobile Network Code operators in the generated data file if they would not confuse the lookup of operational numbers. This avoid problems when the "030" to "039" non-operational ranges conflicting with the "03" operational range. This ensures that only the "03" value is kept. For historical completeness we keep the other non-operational values. Closes https://github.com/arthurdejong/python-stdnum/issues/257
* Switch postal code download to Austrian open-data portalArthur de Jong2021-01-241-62/+13
| | | | | | | | | | | | | This simplifies the process of downloading Austrian postal codes by downloading a JSON blob instead from https://www.data.gv.at/katalog/dataset/f76ed887-00d6-450f-a158-9f8b1cbbeebf This filters the list to only use addressable (adressierbar) postal codes because it matches the previous list. Thanks Bernd Schlapsi for providing the pointer. Closes https://github.com/arthurdejong/python-stdnum/issues/235
* Update database filesArthur de Jong2021-01-113-10/+24
|
* Update database filesArthur de Jong2020-08-092-2/+11
|
* Fix Python 3.6 issue in update scriptArthur de Jong2020-08-081-2/+2
| | | | Fixes 180788a
* Add GS1-128 formatArthur de Jong2020-08-081-0/+82
| | | | | | | | | | This adds validation, parsing and encoding functions for GS1-128. It is based on the lists of formats as published by the GS1 organisation. Based on the implementation provided by Sergi Almacellas Abellana <sergi@koolpi.com>. Closes https://github.com/arthurdejong/python-stdnum/pull/144
* Update database filesArthur de Jong2020-01-181-1/+1
|
* Parse multiple Wikipedia pages for full MCC/MNC listArthur de Jong2019-10-271-33/+56
|
* Make the IEEE OUI data more compactArthur de Jong2019-10-271-8/+30
| | | | | This groups consecutive assignments into a range to make the dat file a little more readable.
* Switch update scripts to Python 3Arthur de Jong2019-10-2711-42/+29
|
* Switch update scripts to use requestsArthur de Jong2019-10-2710-64/+66
| | | | This makes the scripts more consistent.
* Switch update scripts to lxmlArthur de Jong2019-10-276-77/+64
| | | | | | | This avoids an extra dependency on Beautiful Soup and makes the scripts more consistent. This also includes a fix in the ISIL because of website changes.
* Work around incorrect jpn.gov.my certificate chainArthur de Jong2019-10-272-4/+59
| | | | | | The intermediate certificate for jpn.gov.my is missing from the certificate chain that is returned by the server since the server switched to HTTPS.
* Fix New Zealand Bank Branch Register update scriptArthur de Jong2019-10-271-32/+6
| | | | | There is now a direct URL for the XLS file and there is no longer a need to search the page for a link.
* Switch update scripts to beautifulsoup4Arthur de Jong2019-03-104-10/+23
|
* Add New Zealand bank account numberArthur de Jong2019-03-101-0/+116
|
* Switch from import-order to isortArthur de Jong2019-03-034-9/+10
|
* Fix typosArthur de Jong2019-02-171-1/+1
|
* Add MAC addressArthur de Jong2018-12-291-0/+70
| | | | | | | | This adds validation of MAC (Ethernet) addresses. It will perform lookups in the IEEE registry for manufacturer names for universally administered addresses. Closes https://github.com/arthurdejong/python-stdnum/issues/93
* Get files ready for 1.10 release1.10Arthur de Jong2018-10-141-7/+13
|
* Get files ready for 1.9 release1.9Arthur de Jong2018-04-141-2/+4
|
* Correctly split IMSI with multi-length MNCArthur de Jong2018-04-101-2/+3
| | | | | | | | | | | | Ensures that imsi.split() will correctly split the IMSI into an MCC, MNC and MSIN even if not all MNCs within a single MCC have the same length. This has the downside of not being able to guess the MNC length in some cases. This also omits empty information from the data file and updates the data file from Wikipedia. Closes https://github.com/arthurdejong/python-stdnum/issues/68
* Add Austrian postal codeArthur de Jong2018-03-171-0/+102
| | | | | This also fixes the name of the Dutch postal_code alias and tunes the tests.
* Update the flake8 ignore listArthur de Jong2018-02-142-3/+0
| | | | | | | | | | | | Re-enable the flake8 test for unused imports by explicitly marking imports for namespace purposes. This allows us to remove a few unused imports. A few more cleanups that allow us to reduce the number of ignored flake8 ignored tests. The remaining ignored tests are now documented. Ignore a flake8 warning about print statements because we use print in the update scripts.
* Add support for Belgian IBAN formatArthur de Jong2018-02-102-0/+94
| | | | | | | This adds further checks to the country-specific part of the IBAN for Belgian IBANs. Closes https://github.com/arthurdejong/python-stdnum/issues/62
* Get files ready for 1.8 release1.8Arthur de Jong2018-01-031-1/+7
|
* Move update scripts to own directoryArthur de Jong2018-01-0111-0/+842
This moves all the update scripts to their own directory so they don't clutter the toplevel directory. This also ensures that the scripts are passed through flake8 and makes some adjustments for that alongside a few other cleanups.