Start Date

11-8-2016

Description

The text sections of the SEC mandated annual reports abound with important corporate operational information but they are hard to manipulate in bulk because of the varying formats used by the submitting companies. Researchers and private entities have demonstrated the difficulties inherent in extracting and accumulating certain textual portions of these reports. This paper proposes an XML schema that will follow a specific DTD for the 10-K (and 10-Q) reports. Using simple computer commands, the ease of manipulation of the reports text sections is demonstrated.

Share

COinS
 
Aug 11th, 12:00 AM

An SEC 10-K XML Schema Extension to Extract Cyber Security Risks

The text sections of the SEC mandated annual reports abound with important corporate operational information but they are hard to manipulate in bulk because of the varying formats used by the submitting companies. Researchers and private entities have demonstrated the difficulties inherent in extracting and accumulating certain textual portions of these reports. This paper proposes an XML schema that will follow a specific DTD for the 10-K (and 10-Q) reports. Using simple computer commands, the ease of manipulation of the reports text sections is demonstrated.