WARNING: THIS SITE IS A MIRROR OF GITHUB.COM / IT CANNOT LOGIN OR REGISTER ACCOUNTS / THE CONTENTS ARE PROVIDED AS-IS / THIS SITE ASSUMES NO RESPONSIBILITY FOR ANY DISPLAYED CONTENT OR LINKS / IF YOU FOUND SOMETHING MAY NOT GOOD FOR EVERYONE, CONTACT ADMIN AT ilovescratch@foxmail.com
Skip to content

QUMIA/data-preprocessing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

data-preprocessing

This repo contains a script that finds, processes and psuedonymizes the data.

Dependencies

dotenv
numpy
pandas
pyreadstat
pydicom
opencv-python
scipy

Configure

The script will look for a number of environment variables (ideally put them in a .env file in the project root):

QU_SALT         # Secret for id pseudonymization
QU_INPUT        # Input SPSS file
QU_OUTPUT       # Where to put the resulting csv file
QU_IMG_IN_DIR   # Where to find the image directories
QU_IMG_OUT_DIR  # Where to put the converted images

Run

Simply run python main.py

About

Some data preprocessing to prepare for machine learning

Resources

License

Stars

Watchers

Forks

Languages