Start Date
11-8-2016
Description
The text sections of the SEC mandated annual reports abound with important corporate operational information but they are hard to manipulate in bulk because of the varying formats used by the submitting companies. Researchers and private entities have demonstrated the difficulties inherent in extracting and accumulating certain textual portions of these reports. This paper proposes an XML schema that will follow a specific DTD for the 10-K (and 10-Q) reports. Using simple computer commands, the ease of manipulation of the reports text sections is demonstrated.
An SEC 10-K XML Schema Extension to Extract Cyber Security Risks
The text sections of the SEC mandated annual reports abound with important corporate operational information but they are hard to manipulate in bulk because of the varying formats used by the submitting companies. Researchers and private entities have demonstrated the difficulties inherent in extracting and accumulating certain textual portions of these reports. This paper proposes an XML schema that will follow a specific DTD for the 10-K (and 10-Q) reports. Using simple computer commands, the ease of manipulation of the reports text sections is demonstrated.