Russian  English  All projects

Parser xml in python or api of information about contracts with ftp public procurement



The task is to obtain information on all information on contracts, as well as on all participants with whom contracts, penalties and fines are terminated. The data is stored on the public procurement ftp in the folder fcs_regions -> the name of the region -> folders starting with contracts. The data is stored in xml (in theory of everything, with some figured out where it is stored, with some it is to be). We need all the data for 2022.

All the data on the contract from the xml related to the contract are clear: penalties, fines, terminations, ALL stages, terms, name of the customer, etc. It is necessary to make a daily update, put a parser on the crown to supplement customers. The data should be uploaded to the mysql tables by county (there is a ready-made script for dividing by districts, executor skin). You can do it while on your locale, then you will need to put it on the server.

As a result, we should get a database in MySQL on a daily update with all the data on contracts given at the links: customer data, contractor, information about the contract, contract stages, terms, penalties if any, information about termination if any.

The data is stored on the FTP in xml format, for example I can throw off the curve of the contract parser.

Login details: ftp://ftp.zakupki.gov.ru/ free free login details

I attach an example of what is in the region folder in the ContractProcedureDocs folder in the archive for 24-25.10 and in the contracts folder
That is, the data on the company with which they are terminated are stored in the ContractProcedureDocs folder, part of the contract data is in the contracts folder. Penalties and fines seem to also lie there.
As an option, download the entire region and search through the files in search of a fine or penalty by number. I also throw off the parser at the stages of the contract, partly it is a worker. The downside is that we need all the stages of the contract, but when updating, it happens that the data is erased.
The contract number is tied to the notification and there is information in this contract

The document contains a more complete TK with screenshots.

Preference to those who worked with the FTP of public procurement, their structure and information. Otherwise, it will be difficult and it will take a long time to understand.
02.11.2022 11:07



 Answers freelancers