Data Extraction is the process of extracting data from a variety of sources for further analysis. A Data Extractor is someone who helps businesses and organizations gain insight from their data and create descriptive and predictive models. They specialize in finding patterns and relationships that guide decisions and uncover meaningful information. Through carefully crafted queries and processes, our Data Extractors can transform raw data into a useful format that can be used for reporting, analytics, machine learning and more.

Here's some projects that our expert Data Extractors made real:

  • Verifying shopper loyalty in Hong Kong, Netherland and Bulgaria
  • Gathering product pricing information in Tokyo
  • Conversion of text files into Excel documents or databases
  • Assisting with the extraction of website data
  • Accurately replicating an SQL Database
  • Organizing online booking forms
  • Increasing visibility with link optimization

When you partner with an experienced team of Freelancer's Data Extractors you can access valuable insights from your data that can guide decisions, uncover opportunities and create predictive models with new data sources. Our experts can help you unlock deeper insights with advanced filtering methods and complex coding. Explore the full range of possibilities with our talented community of professionals, capable of delivering comprehensive solutions tailored to your needs.

Ready to launch your very own project on Freelancer.com? We invite you to try us out and hire our experienced Data Extractors to make your design goals a reality. Let their creativity, skill, and proficiency bring something special to your project!

Conform celor 143,314 recenzii, clienții îi evaluează pe Data Extractors cu 4.9 din 5 stele.
Angajează Data Extractors

Data Extraction is the process of extracting data from a variety of sources for further analysis. A Data Extractor is someone who helps businesses and organizations gain insight from their data and create descriptive and predictive models. They specialize in finding patterns and relationships that guide decisions and uncover meaningful information. Through carefully crafted queries and processes, our Data Extractors can transform raw data into a useful format that can be used for reporting, analytics, machine learning and more.

Here's some projects that our expert Data Extractors made real:

  • Verifying shopper loyalty in Hong Kong, Netherland and Bulgaria
  • Gathering product pricing information in Tokyo
  • Conversion of text files into Excel documents or databases
  • Assisting with the extraction of website data
  • Accurately replicating an SQL Database
  • Organizing online booking forms
  • Increasing visibility with link optimization

When you partner with an experienced team of Freelancer's Data Extractors you can access valuable insights from your data that can guide decisions, uncover opportunities and create predictive models with new data sources. Our experts can help you unlock deeper insights with advanced filtering methods and complex coding. Explore the full range of possibilities with our talented community of professionals, capable of delivering comprehensive solutions tailored to your needs.

Ready to launch your very own project on Freelancer.com? We invite you to try us out and hire our experienced Data Extractors to make your design goals a reality. Let their creativity, skill, and proficiency bring something special to your project!

Conform celor 143,314 recenzii, clienții îi evaluează pe Data Extractors cu 4.9 din 5 stele.
Angajează Data Extractors

Filtrare

Căutările mele recente
Filtrează în funcție de:
Buget
la
la
la
Tip
Aptitudini
Limbi
    Starea proiectului
    37 proiecte găsite
    Recover Cached Wallet Seed Image -- 2
    6 zile left
    Cont confirmat

    I recently generated a 24-word recovery phrase for a crypto wallet while using Brave on my MacBook (macOS Ventura). I am now trying to retrieve that phrase—or at least the image that Brave displayed—by digging through the local browser cache. The computer is in my possession, and it has been turned off and on since the session (approx June 2025). I also have a Time Machine backup from around that time too. What I need from you is straightforward: locate and extract any image files Brave may have stored of that page during the session. A recovered PNG, JPG, or WebP of the phrase is ideal, but if you uncover the text itself in a different format, that’s a welcome bonus. You can follow the exact steps I did to create a phrase yourself as a real life test case and use your...

    $115 Average bid
    $115 Oferta medie
    5 oferte

    I have a set of existing spreadsheets filled with plain text records that now need to live in clean, well-structured CSV files. Your task is to pull every row from those sheets, check that the headings stay consistent, and export the results as UTF-8 CSV without introducing any hidden characters or broken line breaks. The spreadsheets are already organised, so no deep data cleansing is required—just tidy up obvious spacing issues, preserve punctuation, and make sure every cell ends up in the correct column position in the final CSV. I will share the sheets in either Google Sheets or Excel; work in whichever environment you prefer as long as the finished deliverable is a set of ready-to-import CSV files. Deliverables • One CSV per source sheet, correctly named and enc...

    $478 Average bid
    $478 Oferta medie
    15 oferte

    I need an experienced Python/Selenium developer for a web scraping project targeting TripAdvisor. The goal is to extract hotel reviews at scale and deliver clean, structured data in CSV/Excel. Fields required per hotel: - Hotel Name - Reviewer Name - Review Rating - Review Title - Review Body - Date of Stay Technical requirements: - Handle pagination automatically (Next button) - 500+ reviews per hotel URL - Crash-safe incremental saving - Anti-detection measures (random delays, rotating user agents) TO APPLY you must complete a free sample test: Scrape 50 reviews from one hotel URL I provide and send the CSV. No sample = no response. Budget: $200 fixed Timeline: 5 days

    $145 Average bid
    $145 Oferta medie
    43 oferte
    Targeted Email/contact Web Scrape -- 2
    6 zile left
    Cont confirmat

    Im looking for a large amount of lists and looking to pay around $1 per 20 contacts with valid email addresses harvested from lists I'll give you. I only neeed proper clean verified and contactable info! Objective: outreach to sell books. You might use the stack you are most comfortable with Python + BeautifulSoup, Scrapy, Selenium, Node.js + Puppeteer, or similar 'as long as the final data is clean and deduplicated' Deliverables: • A CSV file containing each email address alongside the exact page URL where it was found • A brief note on the toolchain or script used (for reproducibility) I need the lists to be well-targeted, deduplicated, and usable for outreach. Accuracy matters. Do Not give me a bloated list full of bounces... Are the contacts manually c...

    $97 Average bid
    $97 Oferta medie
    71 oferte

    **Title:** QCOW2 Disk Recovery Expert Needed (750GB VM Image – Data Recovery) **Description:** I am looking for an experienced Linux/KVM expert who can help recover data from a QCOW2 disk image. **Details:** * File format: QCOW2 * Virtual disk size: 750 GB * Actual file size: ~75 GB * Issue: Partition table is missing/corrupted * Current status: * Disk shows as "data" * No partitions detected * Tried basic tools (fdisk, testdisk, photorec) without success **Objective:** Recover important data (especially database files and system data) from the QCOW2 image. **Expected Skills:** * Strong experience with QCOW2, KVM, QEMU * Disk recovery expertise (LVM, ext4, XFS, partition recovery) * Familiar with tools like: * qemu-nbd * guestmount / libguestfs * testdis...

    $112 Average bid
    $112 Oferta medie
    8 oferte

    Title: Playwright Expert Needed for Complex JS UI Scraping (Nested Modals, Scroll Containers) --- Description: I need an experienced developer to complete and stabilize an existing Playwright (Python) scraping system for a complex web app (React-like UI). This is NOT a simple scraping task. The system is partially built and must be fixed, not rebuilt from scratch. --- Scope of Work: The scraper must extract ~62,000 products with the following data: 1. Product Specifications * Open product modal * Extract full specification table * Table may be internally scrollable (must handle both scrollable and non-scrollable cases) 2. My Last Purchase Price * Handle 3 cases: a) NA → no further actions b) Single price → click icon → extract purchase order ...

    $69 Average bid
    $69 Oferta medie
    33 oferte
    Data entry
    4 zile left

    I need support transferring text-only information from stacks of physical documents into a clean, well-structured Excel workbook. The sheets are already laid out; your role is to key every word exactly as shown on the page, keep the columns consistent, and flag anything that is unclear or illegible so I can double-check the source. Accuracy matters more than speed, and all material is confidential company data, so you must be comfortable following a simple non-disclosure step before beginning. I will provide scanned PDFs of the paper records, an Excel template, and brief field definitions so you know what goes where. Deliverables • Completed Excel file containing every record from the supplied documents • A short note highlighting any ambiguous or missing fields you encou...

    $17 Average bid
    $17 Oferta medie
    16 oferte

    recuperación forense de datos Android (WhatsApp) Busco un profesional con experiencia real en recuperación de datos a bajo nivel para un caso técnico específico en Android. Este proyecto requiere conocimientos avanzados, no soluciones básicas. Descripción del problema Se necesita recuperar un archivo de copia de seguridad local de WhatsApp ("") que fue eliminado del almacenamiento interno del dispositivo. - No disponible en papelera - Sin copia en Google Drive - Probable sistema de archivos: EXT4 o F2FS Alcance del trabajo - Análisis de almacenamiento a nivel de bloque - Extracción de datos desde sectores libres (file carving) - Identificación y reconstrucción de fragmentos de bases de datos SQLite - Posib...

    $435 Average bid
    $435 Oferta medie
    20 oferte
    Chinese PySpark Futures Notebook
    4 zile left
    Cont confirmat

    I need a Microsoft Fabric notebook written in PySpark that can call a suitable commodities-exchange API and pull the most recent futures data on a couple select futures products. The solution should be fully runnable inside Fabric. Key points I have set: • Refresh cadence: once a month, so include a simple scheduling example (Fabric pipeline or a cron-style note is fine). This data will be pulled from a Chinese exchange, so I want a Chinese speaking freelancer only!! Deliverables 1. The .ipynb (or .notebook) file ready to import into Fabric 2. A quick test run showing one successful fetch and a tidy DataFrame with the fields timestamp, contract, price, and volume Acceptance will be based on the notebook executing end-to-end without manual edits (apart from entering an API ke...

    $31 Average bid
    $31 Oferta medie
    17 oferte

    I have a small, well-scoped task centred on improving the existing backend logic of my application. The codebase is stable, but some routines are slower and more convoluted than they need to be, which is starting to bottleneck new feature development. What I need from you is a clean refactor of the relevant modules so they run more efficiently, are easier to read, and remain fully compatible with the rest of the system. I will provide the current repository, a short architectural walkthrough, and unit tests that must keep passing after your changes. Acceptance criteria: • Functionality and API surface remain unchanged. • All existing automated tests pass; new tests cover refactored paths. • Measurable improvement in execution time for the target routines (I&rsquo...

    $3 / hr Average bid
    $3 / hr Oferta medie
    5 oferte

    I run our purchasing and accounts-payable flow entirely in Odoo, yet the three-way match between Goods Receipt (GRN), supplier invoices, and purchase orders is still a manual chore. What I now need is an Odoo-native solution that automatically reconciles invoices against their corresponding POs (PDF format) and clearly flags quantity or price variances before payment is approved. Scope of work • Build or configure an Odoo module/automation that ingests PDF invoices and POs, extracts the relevant line-item data, and matches it to existing PO records. • Where a GRN already exists the tool should reference it; if not, it should still reconcile invoice vs PO and highlight any missing receipts. • Variance thresholds, approval flows, and exception reports must be configur...

    $212 Average bid
    $212 Oferta medie
    36 oferte

    there are total 235 questions. Need to extract only questions in word doc. No need to type answer or answer keys. I have sample format that must be followed

    $179 Average bid
    $179 Oferta medie
    17 oferte

    I have a batch of PDFs filled with plain text that I need transcribed into a well-structured Excel workbook. Every word must be captured exactly as it appears, with consistent spelling, punctuation, and capitalisation. I will supply the PDFs and a blank template showing the required column layout; your task is to transfer the content, keep each record on its own row, and flag any illegible sections for my review in a separate sheet. Accuracy is my top priority, so I’ll be double-checking against the source files. Please let me know how quickly you can start, your estimated turnaround for roughly 200 pages, and any previous experience you have converting PDF text into Excel. I’m ready to award as soon as I find the right match and look forward to working together.

    $16 / hr Average bid
    $16 / hr Oferta medie
    11 oferte
    Automated News Summary Podcast
    3 zile left
    Cont confirmat

    I need a solution to automate the generation of a summary podcast from news articles, publications, and substacks. The output should be an mp4 file. Requirements: - Scrape a provided set of URLs containing news articles. - Create an API that automates the generation of summary podcasts. - Summaries should be in a conversational tone. - The podcast would be created every monday morning based on news published Ideal Skills: - Experience in web scraping - API development skills - Background in content summarization - Familiarity with podcast formatting and MP4 generation Looking forward to your bids!

    $456 Average bid
    $456 Oferta medie
    62 oferte

    I have a collection of mixed text-and-number records that must be transferred with precision into MS Excel and Word. The priority is fast, accurate data entry: every value needs to land in the right column, table or paragraph, free of typos and aligned with the existing template. Along the way, I expect you to catch obvious inconsistencies—duplicate IDs, misplaced decimal points, stray characters—and correct them so the sheets and documents are analysis-ready the moment you hand them back. Familiarity with Excel functions such as data validation, conditional formatting and simple lookups will help you flag issues quickly; in Word you should know basic styles and table tools to keep everything tidy. Deliverables • A completed Excel workbook and matching Word file with a...

    $2 / hr Average bid
    $2 / hr Oferta medie
    16 oferte

    I have a set of figures that need to be copied accurately from various online sources into my spreadsheet. Every entry you make will be strictly numerical—no text fields to worry about—so the focus is on precision, speed, and consistency. You will receive clear links to the pages where the numbers appear. Most of them are straightforward web pages, but you may occasionally pull figures from emails or social platforms if they contain the statistics I need. As long as the data ends up in the correct columns and matches the original value, the job is a success. I’ll share a Google Sheet with the exact column structure and a few sample rows so you can mirror the required formatting. Please double-check each entry for typos or misplaced decimals before marking a row complete...

    $9 / hr Average bid
    $9 / hr Oferta medie
    15 oferte

    I have a batch of PDF documents that need to be transcribed into an Excel workbook with complete accuracy. Every value, label, and figure that appears in the source files must be captured exactly as written and placed in the correct columns or rows so the spreadsheet is ready for immediate analysis on my end. Key points for success • Absolute accuracy—no typos, omissions, or swapped numbers. • Consistent structure—each PDF should map to the same column order so I can sort and filter with ease. • Timely delivery—please let me know how quickly you can turn the files around and keep me updated on progress. I will supply the PDFs and a template Excel file (or you may build the template yourself after reviewing the first few documents, whichever is faste...

    $2 / hr Average bid
    $2 / hr Oferta medie
    15 oferte

    there are total 235 questions. Need to extract only questions in word doc. No need to type answer or answer keys. I have sample format that must be followed

    $11 Average bid
    $11 Oferta medie
    17 oferte

    I need a Microsoft Power Automate Desktop (PAD) flow designed specifically for daily bank reconciliation tasks. The workflow will primarily interact with an online banking portal to pull bank statement data. Additionally, it will need to integrate with internal sheets to support the reconciliation process. The primary goal is to create an efficient, automated system that can handle routine reconciliation tasks by connecting these data sources seamlessly, extracting the relevant information, and flagging discrepancies where necessary. If you have expertise in building PAD flows and working with similar setups involving online portals and internal spreadsheets, I’d love to collaborate with you on this project. The final deliverable should be a fully functional PAD flow ready for depl...

    $10 Average bid
    $10 Oferta medie
    1 oferte

    I need an experienced engineer who has already worked inside Oracle Agile PLM (E6) to pull out Product data and Document data cleanly and reliably so that I can load it into Siemens Teamcenter. The engagement is entirely about safeguarding data integrity—I am willing to trade speed for absolute correctness. The work involves: • Understanding the E6 database schema, metadata tables, and file vault structure • Writing repeatable PL/SQL / SQL Plus queries or custom Java-based utilities that can scale to the full production volume • Exporting BOMs, items, revisions, attributes, attachments, and related files in a format Teamcenter import tools will accept (XML, CSV, JT, bulk-load packages, etc.) • Logging, error handling, and reconciliation reports that prove...

    $9 / hr Average bid
    $9 / hr Oferta medie
    5 oferte

    Necesito las transcripciones completas de todos los videos (lives) de dos canales de YouTube. Qué espero: • Un archivo de texto (.txt) por cada video con el diálogo íntegro en inglés, sin descargar ni entregar los videos. • El texto debe conservar el orden y la puntuación básica para que sea fácil de leer. • Prefiero que indiques el título del video, fecha y hora de publicacion, y su enlace al inicio de cada archivo para referencia rápida. Yo proporcionaré los enlaces de ambos canales apenas iniciemos. Aporta tu método habitual de extracción (por ejemplo, YouTube API o scripts propios) y un plazo estimado para completar todo el lote. Busco precisión y entrega organizada; s...

    $39 Average bid
    $39 Oferta medie
    10 oferte

    I need a reliable web-scraping bot that automatically pulls fresh content from a specific news site on a schedule I can adjust. At minimum the script should capture the headline and full article text; if author name, publication date, and embedded image URLs can be extracted too, that’s a welcomed bonus. Build it in Python using a well-supported stack such as Requests/BeautifulSoup, Scrapy, or Selenium—whatever you feel is most robust for handling pagination and occasional layout changes. The bot should: • Navigate through the latest articles section (and subsequent pages if present) • Respect and reasonable rate-limits • Output clean, de-duplicated data to CSV or JSON and optionally push to a simple SQLite file For acceptance I’ll run the scrip...

    $13 / hr Average bid
    $13 / hr Oferta medie
    41 oferte

    I have several PDF files that contain a mix of text and numbers. I need every piece of information transferred with perfect accuracy into a structured, editable format—most likely an Excel workbook or Google Sheet, but I’m open to suggestions if you have a more efficient workflow. Your job will be to: • Open each PDF and extract every data point exactly as it appears • Preserve the distinction between text fields and numeric values so formulas can run without additional cleaning • Double-check totals, dates, and any other critical figures for consistency I will supply the PDFs and a simple template for the target file before we start. Please keep communication clear as you work; a quick note if something looks ambiguous is far better than a silent assumpt...

    $9 / hr Average bid
    $9 / hr Oferta medie
    19 oferte
    Refinitiv Workspace Data Extraction
    2 zile left
    Cont confirmat

    Please follow these steps exactly and send me two separate Excel files when done. FILE 1 — SMEs with ESG, Carbon, Energy and Financial Data Step 1 — Open Equity Screener — Open Refinitiv Workspace — Click Screener from the top menu — Select Equity Screener Step 2 — Country Filter Select all emerging market countries using the MSCI Emerging Markets group. If MSCI group is not available, select these countries manually: China, India, Brazil, South Africa, Mexico, Indonesia, Turkey, Saudi Arabia, UAE, Qatar, Kuwait, Bahrain, Oman, Egypt, Malaysia, Thailand, Philippines, Vietnam, Pakistan, Bangladesh, Nigeria, Kenya, Ghana, Morocco, Tunisia, Jordan, Chile, Colombia, Peru, Argentina, Romania, Hungary, Poland, Greece Step 3 — Size Filter &mda...

    $52 Average bid
    $52 Oferta medie
    15 oferte
    extract only questions from images
    2 zile left
    Cont confirmat

    there are total 235 questions. Need to extract only questions in word doc. No need to type answer or answer keys. I have sample format that must be followed

    $2 / hr Average bid
    $2 / hr Oferta medie
    50 oferte

    I have a batch of PDF documents that need to be transcribed into an Excel workbook with complete accuracy. Every value, label, and figure that appears in the source files must be captured exactly as written and placed in the correct columns or rows so the spreadsheet is ready for immediate analysis on my end. Key points for success • Absolute accuracy—no typos, omissions, or swapped numbers. • Consistent structure—each PDF should map to the same column order so I can sort and filter with ease. • Timely delivery—please let me know how quickly you can turn the files around and keep me updated on progress. I will supply the PDFs and a template Excel file (or you may build the template yourself after reviewing the first few documents, whichever is faste...

    $384 Average bid
    $384 Oferta medie
    59 oferte
    Daily Walmart.ca Product Scraper
    2 zile left
    Cont confirmat

    I need an automated script that visits under a specific store location (set by postal code) and captures every product available across all categories. The scraper has to run once every 24 hours, overwrite data in a structured file (CSV or JSON—your call), and handle the usual road-blocks on large sites such as store-selection prompts, pagination, lazy-loaded content, captchas, and IP throttling. What I expect from you • A clean, well-commented Python solution—Scrapy, Selenium, Playwright or a comparable stack are all fine as long as it is headless and low-maintenance. • A simple config or .env so I can change the target postal code or output path without touching the code. • A scheduler (cron job, Windows Task Scheduler, or a lightweight cloud function)...

    $26 Average bid
    $26 Oferta medie
    15 oferte

    The goal is to turn a collection of Law360 (LexisNexis) PDF articles into a clean, tabular dataset that I can open in Excel or any CSV-compatible tool. From each PDF I need the following fields captured: • News date • Filing date • Court • Plaintiff (own column) • Defendant (own column) Accuracy matters: plaintiff and defendant names must sit in separate columns just as selected. Use any reliable text-parsing approach—Python with pdfminer, PyPDF2, Tika, Regex, or an NLP library—so long as the script handles typical Law360 layouts and can be rerun on future batches. Please return: 1. The compiled .csv or .xlsx file. 2. The extraction script with brief instructions so I can reproduce or extend the process. 3. A short report of any PDFs that...

    $432 Average bid
    $432 Oferta medie
    64 oferte

    My Asus X1505Z, fitted with an M.2 SSD, is locked by BitLocker after a former employee enabled encryption and left without sharing the password or recovery key. I must regain full access to every file on the drive—there are no backups to fall back on—yet I would like the work carried out entirely through remote means. I do not have the PIN nor the recovery key for the laptop. The device was not a part of active directory and the person used his own Microsoft account. To my knowledge it is locked with PIN and TPM. I can allow remote access via Anydesk as i can boot it to Hiren or Medicat via usb. I am willing to obtain any required software in support of this recovery within reason (not more than the cost of the freelancer fee) If you have proven experience in Windows Bi...

    $344 Average bid
    $344 Oferta medie
    24 oferte

    لدي حوالى ١٥٠٠ منتج أرغب في نقلها من موقع إلكتروني واحد إلى ملف ‎Excel متوافق مع منصة ‎سلة. المطلوب هو استخراج جميع البيانات الأساسية لكل منتج—الاسم، السعر، الصور، الوصف، الفئة، ورقم المنتج—مع الالتزام بنوع المنتج المحدد فقط، ثم ترتيبها في الأعمدة التي تعتمدها ‎سلة لعملية الاستيراد. ما أحتاجه بالتحديد: • ملف ‎Excel منسّق جاهز للرفع على ‎سلة (ورقة واحدة تكفي). • مجلد صور منظّم، أو روابط صور مباشرة صالحة، بحيث يتعرّف عليها نموذج الاستيراد في ‎سلة. • دقّة كاملة في نقل الأسعار والعناوين العربية، وعدم ترك حقولٍ فارغة. • تسليم العمل في أقرب وقت ممكن؛ السرعة مهمة لكن دون المساس بالجودة. سأوفّر لك رابط الموقع والمعلومات الخاصة بنوع المنتج المطلوب، وأتوقع معاينة عيّنة صغيرة قبل تسليم الملف النهائي لاعتماد الصيغة.

    $59 Average bid
    $59 Oferta medie
    22 oferte
    WhatsApp Message Extraction to CSV
    1 zi left
    Cont confirmat

    I need a small, reliable utility that connects to the official WhatsApp Business API, pulls the full text history from both group and individual chats, and saves every message (plus date, time, sender, and chat ID) into a clean CSV file. Key points • Scope is message extraction only—no sending, no marketing logic. • Both group and one-to-one threads must be handled in the same run. • Output must be a single CSV per execution, ready for import to spreadsheets or a data warehouse. The solution can be written in Python, Node.js, or any language you are confident will compile on a standard Linux server. Please include clear setup instructions and a brief README so I can change credentials and select date ranges myself. Acceptance criteria – Connects ...

    $13 / hr Average bid
    $13 / hr Oferta medie
    78 oferte

    I need an Outlook email workflow that can pull data from CVs and email bodies to build a local database. The workflow should: - Extract the following data: - Contact information - Work experience - Education details - Email ID - Name - Date of birth - Source of data: Excel and MS Word documents Ideal skills and experience: - Proficiency in Microsoft Outlook and Excel - Experience with email automation tools - Strong data extraction and organization skills - Familiarity with MS Word document handling

    $75 Average bid
    $75 Oferta medie
    19 oferte

    I need a robust yet straightforward solution that pulls only the text content from several e-commerce websites. The target fields are product names, long and short descriptions, category labels, and any text-based specifications; I do not need images or pricing data. The scraper should: • Handle pagination, dynamic or lazy-loaded sections, and common anti-bot measures without overloading the servers. • Output clean, well-structured data (CSV or JSON preferred) ready for import into my internal system. • Be written in readable, well-commented code—Python with Scrapy, BeautifulSoup, or Selenium is ideal, but I’m open to equivalent approaches if they achieve the same reliability. • Include simple configuration so I can add or swap target domains later....

    $88 Average bid
    $88 Oferta medie
    47 oferte

    I have a set of PDFs that contain purely textual information presented in a mix of paragraphs, scattered labels, and occasional row-style groupings. I need every single data point moved into a clean Excel workbook so nothing is lost in translation. Because the layout switches between narrative blocks and ad-hoc table-like sections, the job will likely involve both automated capture (Python, Power Query, or your preferred OCR tool) and some careful manual cleanup to preserve order and context. Deliverable: one well-structured .xlsx file per source PDF, with all columns and data fields represented exactly as they appear, ready for filtering and analysis. Consistent header naming and no stray line breaks or merged cells, please. If you’ve handled mixed-format PDFs before and can turn ...

    $23 Average bid
    $23 Oferta medie
    21 oferte

    I have a collection of mixed-content PDFs—some pages are pure scans, the rest contain searchable text. Each file holds key tables I need converted into clean, structured data and then visualised as clear bar charts for quick comparison. You will: • Isolate every table across the PDFs, using OCR where the page is only an image. • Clean and normalise the numbers so columns and units stay consistent. • Produce bar charts that faithfully reflect the extracted figures (one chart per table unless otherwise noted). Deliverables 1. A single, tidy dataset (CSV or Excel) with each table clearly identified. 2. High-resolution bar chart images (PNG or SVG) and the source file (Excel, Power BI, or Python notebook) so I can regenerate them later. 3. A short note outlinin...

    $62 Average bid
    $62 Oferta medie
    15 oferte

    Мне требуется автоматизировано выгрузить данные о зарегистрированных компаниях с биржи грузоперевозок через её официальное API (лицензия на доступ будет оплачена) и аккуратно свести их в одну рабочую книгу Excel. Сначала из ATI нужно получить: - наименование компании, - страну регистрации, - статус в системе (перевозчик / грузовладелец / экспедитор), - контакты всех сотрудников (Ф. И. О., телефон, e-mail), - рейтинг надёжности, - количество сотрудников, - корпоративный сайт. Далее, пройдя по каждому корпоративному сайту, в эту же таблицу добавляются: - парк подвижного состава с указанием типа, - профиль перевозимых грузов, - регионы деятельности, - краткое описание деятельности компании. Ключевым для меня остаётся корректное заполнение имени и контактов кажд...

    $21 / hr Average bid
    $21 / hr Oferta medie
    63 oferte

    Articole recomandate doar pentru tine