129 questions
0
votes
0
answers
51
views
more than one 10-q?
Sorry if this is a stupid question, I am new to using python in data extraction.
When I extract data from EDGAR, there is something that troubles me.
When I fetch 10-Q reports, several similar but not ...
0
votes
1
answer
51
views
Why am I getting an error message when searching for certain facts in 10-Q forms?
I'm fairly new to Python and would really appreciate any help I can get. I apologise if I call something by the wrong name. I am trying to build a tool using Edgar APIs that grabs info from financial ...
2
votes
0
answers
64
views
Cannot get RSS feed using tidyRSS
I have been using the following R code in RStudio for several months and it worked well without a need for while loop, just at the first run.
It takes RSS feed from EDGAR.
feed_url = c("https://...
0
votes
0
answers
126
views
How to declare user-agent in edgar?
I'm trying to figure out how to fix some error messages in R while using edgar package to download filings by following this post.
Code:
ua <- "Name [email protected]" #declare user-agent
...
0
votes
1
answer
74
views
How to ignore or bypass instances that result in NoneType non-iterable object
I am trying to parse a section from 10K in Edgar database, when I run the following code,
# pip install edgartools
import pandas as pd
from edgar import *
# Tell the SEC who you are
set_identity(&...
-1
votes
3
answers
166
views
Iteration an object in Python
I am new to python. I am trying to parse some 10-Ks from Edgar using edgartools and sec-parsers module of python. Here is my code -
import pandas as pd
# pip install edgartools
from edgar import *
# ...
3
votes
0
answers
258
views
How to organise all the data from SEC.Gov Edgar API into instatement and balance sheet
My question in a nutshell:
I am trying to build my own financial data collection python script for stock investing to download company filings data from SEC.Gov. I have develop my script to ...
1
vote
1
answer
101
views
Discrepancy in EDGAR data
My question is about the source of truth regarding EDGAR.
I am developing a simple service for myself to choose stocks based on fundamental data. Therefore I went to the EDGAR site and downloaded all ...
0
votes
0
answers
114
views
parse SEC EDGAR xml file using XML and xlm2 in R
I have a simple download from SEC's EDAGR and parse the XLM file, but something does not work.
library(xml2)
library(XML)
download.file("https://www.sec.gov/Archives/edgar/data/1026144/...
0
votes
2
answers
384
views
Extracting PDF's from EDGAR database
Trying to extract the information from the PDF located here: https://www.sec.gov/Archives/edgar/data/784028/000078402823000002
However, even though this is an open API, it appears that the SEC blocks ...
0
votes
0
answers
75
views
Is it possible to recreate tables from a 10-Q document using linkbases and facts?
For reference, I am testing on this Netflix 10-Q.
Using just the presentation linkbase, I am able to reconstruct what the tables should look like. Tabs indicate indents. This is the statement of ...
-1
votes
1
answer
165
views
How do I access a custom tag in a company's 10q filing?
My issue concerns how to access non us-gapp tagged company data in, for example, a 10-q for the ticker MSFT. See this 10q filing: https://www.sec.gov/ix?doc=/Archives/edgar/data/789019/...
0
votes
1
answer
203
views
How to get S-1 form from SEC using python? [closed]
I am trying to get S-1 balance sheets, income statements, etc. using Python. I've used a lot of packages (sec_api, edgar and others) trying to get the data, but I haven't been able to get it.
Could ...
0
votes
1
answer
646
views
SEC filings data using API in python with filter for specific strings
I'm trying to extract SEC filings in a certain date range with a certain key string, for several different companies.
Basically I'm trying to replicate the search in this link:
SEC Edgar (https://www....
0
votes
0
answers
75
views
Scrapping full Edgar into HTML fails due to condition has length > 1
I always get the following error message despite in my view I am doing everything as in the R documentary.
> output <- getFilingsHTML(cik.no = 789019, 'ALL', 2018, quarter = c(1,2,3,4), ...
0
votes
1
answer
43
views
Finding string or empty column in pandas
This is to create a general use case of cleaning up tabular Data from the SEC EDGAR database.
Considering this below table, I need to remove columns that may contain '[any_int]'
Column A
Column B
...
0
votes
1
answer
201
views
Why purrr:map () function is not working with load () to import Rda datasets
I have downloaded some Rdata files using getMasterIndex function from edgar package.
Now I am trying to load all of these files into RStudio using the following code -
paths <- list.files('Master ...
0
votes
1
answer
650
views
EDGAR RESTful API dei taxonomy
I'm trying to write a Microsoft Excel UDF to pull information from the U.S. SEC's EDGAR database. The documentation I can find says that I can pull data (for example) this way, and I'll get a JSON ...
0
votes
0
answers
35
views
Why is some of the text I extracted not properly decoded in Python? [duplicate]
I have written the following code to download the text of a financial report of Apple on the SEC:
headers = {'User-Agent' : 'email'}
response = requests.get('https://www.sec.gov/Archives/edgar/data/...
1
vote
0
answers
178
views
How to get 10k fillings for the russell 3000 constituents using sec-api?
Is it possible to fetch the 10-K filings using something like the following?
from sec_api import QueryApi
query_api = QueryApi(api_key='YOUR_API_KEY')
# Define search parameters
index_ticker = 'RUA' ...
0
votes
1
answer
64
views
Delete txt file based on keywords from multiple folders
I downloaded a bunch of 10-ks from edgar. I need to keep only 10-k reports with keywords "cryptocurrency" and "blockchain". Each company has one single folder. However, I was stuck ...
2
votes
2
answers
2k
views
Parse SEC EDGAR XML Form Data with child nodes using BeautifulSoup
I am attempting to scrape individual fund holdings from the SEC's N-PORT-P/A form using beautiful soup and xml. A typical submission, outlined below and [linked here][1], looks like:
<...
1
vote
2
answers
151
views
Web scraping for multiple classes using python
I am trying to scrape address from 10K filing document in HTML: https://www.sec.gov/Archives/edgar/data/1652044/000165204419000032/goog10-qq32019.htm
It has multiple div class, and I want to scrape ...
0
votes
1
answer
797
views
Extract 10K filings url for a company using CIK number python
I am working on a project to find the latest 10K filings url for a company using CIK number. Please find the code below:
import requests
from bs4 import BeautifulSoup
# CIK number for Apple is ...
3
votes
1
answer
3k
views
Accessing filing data from SEC EDGAR API in python
I want to pull reports from the SEC EDGAR API and conduct analysis within python. From what I can tell, it looks like the main issue is that I'm using the wrong file format, but methods I have found ...
0
votes
1
answer
748
views
SEC EDGAR API to Google Sheets integration
I want to connect US Securities and Exchange Commission (SEC) API to Google sheet. I want to be able to upload files in JSON from SEK server to Google Sheets directly. Buy could not make any of ...
1
vote
2
answers
10k
views
Downloading all 10-k filings for SEC EDGAR in python
I have currently managed to scrape all filings for a specific ticker eg. 'AAPL' and every type of filing with its link is presented in a massive dictionary. I would like only those links where the '...
2
votes
3
answers
1k
views
Inconsistent tags between XBRL files from the SEC (EDGAR)
I'm parsing every XBRL files from the SEC through EDGAR in order to retrieve some data (in json format on python).
I have no problem parsing those files. My problem lies in the structure of the XBRL ...
0
votes
1
answer
122
views
How can I mitigate encoding error that happens irregularly while scraping data?
I was scraping SEC EDGAR data by running 'MDA Extractor.py' in this link.
https://github.com/rflugum/10K-MDA-Section
As this program was made in Python2, I changed some expressions (e.g., print -> ...
0
votes
1
answer
876
views
Downloading file from the website - HTTPError: HTTP Error 403: Forbidden
I am trying to download 10Ks (annual report of public companies) from EDGAR. I am running the code below (used it from the textbook, don't understand much of it), but keep getting the following error: ...
4
votes
3
answers
3k
views
How to get data from SEC Edgar python and a json
on the following page below there is as Data source a json link:
https://www.sec.gov/edgar/browse/?CIK=1067983&owner=exclude
Data source: CIK0001067983.json -> https://data.sec.gov/...
-1
votes
2
answers
253
views
What does the `end` property mean in the XBRL data?
I'm trying to read some programmatic API data from EDGAR, and I'm having trouble understanding the end property. For example, when looking at AssetsCurrent:
"end": "2018-03-31",
&...
0
votes
0
answers
174
views
Save (pre-2012) 13-F filings to Pandas Dataframe using python
I saw the question on pre-2013 13-F filings, but noticed they used an even different format pre 2012. This is the original question: Extracting table of holdings from (Edgar 13-F filings) TXT (pre-...
1
vote
3
answers
3k
views
Extract entire textual data from Edgar 10-K using python
I am trying to extract entire textual data from the given URL below as an example. I have many URLs so automating. I tried every code posted here - they are giving error, eg AttributeError: 'NoneType' ...
2
votes
3
answers
650
views
Syncing another company's financial year to another company using EDGAR database
I'm working on a school project that was built by a previous group, and one of my tasks is to synchronize the fiscal year's start date according to the start date of the company of focus.
This is the ...
0
votes
2
answers
1k
views
How to create user agent?
I'm trying out this function from the package edgarwebR
x <- paste0("https://www.sec.gov/Archives/edgar/data/",
"933691/000119312517247698/0001193125-17-247698-index.htm&...
1
vote
2
answers
672
views
Can ExtractorApi in sec-api module be used for 10-Q filings?
I am trying to extract specific sections from the 10-Q report using ExtractorApi from sec-api module. The module works for 10-K, however, it fails with certain sections for the 10-Q. For example, if I ...
0
votes
1
answer
215
views
How to build up edgar filing calculations when a fact is summed up multiple ways?
The Edgar documentation has some limited information on how to handle facts with different dimension break-downs. Take as an example the AAPL annual report:
On page 29 the total Net Sales (365,817) is ...
2
votes
1
answer
990
views
Build financial table from EDGAR XBRL files
When companies submit their reports to SEC, a number of files are made available. E.g. the latest 10-K of AAPL.
From this even the SEC website (and many others) make the tables as structured data.
...
0
votes
2
answers
749
views
Download a txt file from EDGAR
I want to download this file to my local drive:
https://www.sec.gov/Archives/edgar/data/1556179/0001104659-20-000861.txt
Here are my codes:
import requests
import urllib
from bs4 import BeautifulSoup
...
1
vote
1
answer
587
views
How should I scrape an idx file on EDGAR?
I have an idx file:
https://www.sec.gov/Archives/edgar/daily-index/2020/QTR4/master.20201231.idx
I could open the idx file with following codes one year ago, but the codes don't work now. Why is that? ...
3
votes
2
answers
647
views
JSONDecodeError: Expecting value: line 1 column 1 (char 0) when scaping SEC EDGAR
My codes are as follows:
import requests
import urllib
from bs4 import BeautifulSoup
year_url = r"https://www.sec.gov/Archives/edgar/daily-index/2020/index.json"
year_content = requests.get(...
0
votes
2
answers
1k
views
R: reading old 13F txt files from SEC Edgar database using R edgar package
Hi I'm trying to read the 13F filings in the SEC edgar database using the R edgar package
The challenge I have is the filings I'm looking at is the old filing (~year 2000)
https://www.sec.gov/edgar/...
-1
votes
1
answer
374
views
BeautifulSoup: Get the HTML Code of Modal Footer
I'm new to Web scraping in Python and try to scrape all htm document-links from an SEC Edgar full-text search. I can see the link in the Modal Footer, but BeautifulSoup won't parse the href Element ...
2
votes
2
answers
7k
views
How to Web scraping SEC Edgar 10-K Dynamic data
we are trying to parse SEC Edgar filing using Python .
I'm trying to get this table "Sales By Segment Of Business" at line 21 .
This is the link to the document.
https://www.sec.gov/ix?doc=...
1
vote
1
answer
1k
views
Python Edgar asks for useragent
I ran the following command:
import edgar
import pandas as pd
edgar.download_index('/Users/myusername/Desktop/Desktop', 2010,skip_all_present_except_last=False)
It's throwing the following error ...
0
votes
0
answers
255
views
Problems with getMgmtDisc (from edgar): "no filing information found" or "object ´words.count´ not found"
I am unfortunately working with R for the first time and I am having a problem with the package "edgar".
My code is as follows:
Normal:
MgmtDisc <- getMgmtDisc(cik.no = c(1390777, 1077771,...
0
votes
2
answers
8k
views
Downloading files from sec.gov via EDGAR using Python 3.9
I am new to the world of coding, so please bear with me if I misuse terminology or generally do not know what I am talking about. I am doing a research project in which I am trying to scrape public ...
1
vote
0
answers
166
views
Download multiple 10-ks documents
I need to download multiple 10-ks documents, however, this code works fine if i download the 10-ks between 5-10 companies. But if i increase the number of companies in [cik_lookup function]. Here's ...
1
vote
1
answer
629
views
Retrieving S-1 filings from EDGAR based on SIC using R
I am trying to analyse the S-1 filings of all Special Purpose Acquisition Companies (SIC=6770) but I am having trouble finding a way of getting this data from SEC EDGAR in an efficient way. I have ...