The purpose of this page is to outline the data format that needs to be provided to Sopht, in order to compute their web information.
| Version | Date | Author | Description |
|---|---|---|---|
| v1.0 | 13/03/2023 | Réda Pengam | First version of the document |
| v1.1 | 18/02/2025 | Fumo Aristide | Clarification of csv fields |
Character encoding: UTF-8 (without BOM)
The line separator is the character set CRLF: Carriage Return (carriage return) and Line Feed (line feed). The last line may or may not end with this separator.
The file must have a first "header" line reminding the names of the fields defined below.
Columns are separated by the character ,
Each value can be surrounded by double quotes “. If a value contains the character , it becomes necessary. If the “ character must appear inside a surrounded value, the character must be doubled.
For further details on the CSV format, see the RFC 4180.
The file name must be in the form of: web_information_YYYYMMDD.csv (ex : “web_information_20241201.csv”).
Sopht processes data only once, thus printing values for a specific date won’t be modified after the first shared file with those data.
You can remove optional columns altogether, or change the order of the columns, although the names of the columns must remain unchanged.