Extracting reports from a website based medical records system, refining the data, and consolidating it in a sheet
Hi, I am hoping someone can point me in the right direction. I know this is a loaded question, but even if you are able to help me understand what features I can use to make some of this happen, I'd really appreciate it! Thank you in advance!
I am fairly new at excel but have some experience with R and python for data analysis. I have thought of a bit of a workflow but I am happy to clarify more!
I need to do the following:
- From a website that hosts a cloud based medical charting system that is behind a username and password, I need to log in, navigate to a certain webpage, have certain buttons and items "clicked" to generate an excel report.
- Download that report. Let's call it "Report A1"
- In a similar fashion, download other reports. Let's call them "Report B1" and "Report C2".
- Open each report, do some data transformation and analysis, and put all the data from report A, B, and C it into a new sheet in a certain format.
I would love to know: What are the steps and systems I need to learn to make this happen? I know this might end up being a long project but I'm willing to learn and spend some time.
Problems I anticipate:
Potential problem #1: Logging in to actually get the data from the website that houses the electronic medical charting system. A while back I was semi successful in writing a python script with selenium if I remember correctly, to 'click' the right things and get report A, but it was tough and finicky. If this isn't possible I can manually download the reports and put them in a folder.
Potential problem #2: The name of the report will change but will always contain certain phrases, such as Report type B will always be named "Report B1" or "Report B2" or "Report B3" etc. So while name of the report may change I can perhaps stick them all in a folder and when importing 'filter' them based on a name.)
I know this is a loaded question, but please point me in the right direction! Thank you!
[link] [comments]
Want to read more?
Check out the full article on the original site