Fixed issues and Implementations#2
Open
Bamdad-Mehrvarzan wants to merge 1 commit into
Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
OpenAlex ETL Pipeline Implementation & Dashboard Compatibility Patches
1. Architectural Overview (Object-Oriented Design)
We have successfully engineered a robust, production-ready ETL pipeline tailored for the OpenAlex API, mapped to the legacy Web of Science (WoS) target schema. The architecture strictly follows object-oriented principles:
convert2df_api), dynamically decoupling extraction, transformation, and validation layers.OpenAlexTransformerto seamlessly convert nested JSON responses (e.g., authorships, institutions, inverted abstracts index) into normalized WoS flat columns.safe_getmechanism, shielding the analytical engine from API data fragility and structural payload changes.2. Upstream Bug Fixes & Analytical Patches
In strict compliance with the Base Level requirements (Section 3), we reversed-engineered and hot-fixed critical bugs within the legacy analytical dashboard that caused interface crashes:
app.py): Fixed a hardcoded CSS syntax error wherestyle="width=100%;"was used instead ofstyle="width:100%;". This typo originally caused aValueErrorduring internal index unpacking inside theitableslibrary.app.py&cocmatrix.py): Patched the strict case-sensitive evaluation blocks. The dashboard loader automatically overwrites theDBflag to"ISI"in memory, which caused the strictif db == "Web_of_Science"condition to reject the dataset and throw a'NoneType' object has no attribute 'columns'error. Rewrote the conditional mapping to securely accommodate both"WEB_OF_SCIENCE"and"ISI"tokens.3. Verification & Validation Evidence
machine learningquery term.