SOL-e - Automated Reference Extractor (Patented)
A patented tool that leverages web scraping to automate the extraction of BibTeX references from SBC Open Lib (SOL), enabling bulk processing of hundreds of articles simultaneously.

I developed SOL-e to automate the collection of bibliographic references from the SBC Open Lib (SOL). Before this tool, researchers had to manually copy data to generate BibTeX files; now, the system handles batches of over 200 articles in seconds.
Innovation & Patent
Due to its impact on academic productivity, the tool was officially patented. It serves as a dedicated extraction engine focused on the integrity of scientific metadata.
Technical Implementation
The system architecture was designed for robustness and high efficiency:
- Angular Frontend: An intuitive interface for uploading article lists and monitoring extraction progress in real-time.
- Express.js API: A backend processing engine that orchestrates scraping tasks and data transformation.
- Advanced Web Scraping: Reliable navigation and extraction logic tailored to the dynamic structure of the SBC digital library.
- Reference Formatter: An algorithm that normalizes extracted data directly into the BibTeX standard, ready for LaTeX editors.
Impact & Results
The tool drastically accelerated the referencing phase of academic research, eliminating human error and manual fatigue. Scholars can now extract hundreds of citations in seconds, allowing them to focus their valuable time on critical analysis and scientific writing.
SOL-e highlights the use of automation to accelerate academic workflows, turning repetitive manual tasks into efficient digital processes.
