I’m a multilingual linguist transitioning into Data Science & NLP.
With a background in education, communication, and text analysis, I bring human-centered thinking into machine learning and data work. I’m especially interested in corpus curation, NLP, ethical AI, and accessibility through language technologies.
I’m currently training in Data Science & AI at WBS Coding School (Germany) while building portfolio projects in Python, SQL, and Linguistic Data.
- 🐍 Python
- 📊 SQL
- ☕ Java (foundational)
- 🧩 Object-Oriented Programming (OOP)
- 📉 Data Cleaning, EDA & Feature Engineering
- 🔤 NLP Basics & Corpus Annotation
- 🧪 Unit Testing (JUnit)
- Python: Pandas, NumPy, Matplotlib, Seaborn
- Web/Data: BeautifulSoup, Requests, SQLAlchemy
- Visualization: Tableau, looker studio
- Backend: Django (basics)
- Cloud: Google Cloud (basics)
- Dev Tools: Git, Conda, Jupyter Notebook, VS Code, Eclipse
- Databases: SQL, MongoDB (basic)
- Windows, macOS
- TCP/IP (Introduction)
- Team collaboration
- Empathy & clear communication
- Analytical & structured thinking
- Problem solving
- Initiative & self-learning
Basic online banking tool featuring user registration, account management and transfers.
Data analysis of a Brazilian e-commerce dataset. Includes logistics evaluation, revenue insights, operational reliability and product value performance.
Two simple Java games: time conversion and digit sum calculator.
- 🕸️ Library manager (Python)(SQL): Preparing UI with streamlit
- 🔤 Corpus Review Contributions: Reviewing and correcting linguistic dataset entries on open-source platforms
- Data Science & AI (WBS Coding School, Germany)
- Full-Stack Software Development (IBM/Coursera)
- 🇧🇷 Portuguese – Native
- 🇬🇧 English – C1
- 🇩🇪 German – B2
- 🇪🇸 Spanish – C1
🎯 My goal is to help build responsible and inclusive language technologies that empower people through data and AI.