Showing 135 open source projects for "python web crawler"

View related business solutions
  • Simplify Purchasing For Your Business Icon
    Simplify Purchasing For Your Business

    Manage what you buy and how you buy it with Order.co, so you have control over your time and money spent.

    Simplify every aspect of buying for your business in Order.co. From sourcing products to scaling purchasing across locations to automating your AP and approvals workstreams, Order.co is the platform of choice for growing businesses.
    Learn More
  • Award-Winning Medical Office Software Designed for Your Specialty Icon
    Award-Winning Medical Office Software Designed for Your Specialty

    Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

    RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
    Learn More
  • 1
    Best-of Python

    Best-of Python

    A ranked list of awesome Python open-source libraries

    ...If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome! Ranked list of awesome python libraries for web development. Correctly generate plurals, ordinals, indefinite articles; convert numbers. Libraries for loading, collecting, and extracting data from a variety of data sources and formats. Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Web Dev for Beginners

    Web Dev for Beginners

    About 24 Lessons, 12 Weeks, Get Started as a Web Developer

    Web-Dev-For-Beginners is Microsoft’s open source, project-based curriculum for learning web development from scratch. Designed as a 12-week, 24-lesson course, it covers HTML, CSS, and JavaScript fundamentals through hands-on projects like terrariums, browser extensions, and space games. Each lesson includes a mix of pre-lecture quizzes, written content, assignments, challenges, and post-lecture quizzes to reinforce learning. The course also offers global accessibility with translations in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Complete-Python-3-Bootcamp

    Complete-Python-3-Bootcamp

    Course Files for Complete Python 3 Bootcamp Course on Udemy

    ...In addition, it includes applied exercises in areas such as web scraping, working with APIs, and using Python libraries like NumPy, pandas, Matplotlib, and Seaborn for data analysis and visualization. Learners can progress from beginner-friendly basics to more advanced programming skills while reinforcing their knowledge with practice problems and projects. Because it mirrors the course content, this repository is widely used by students taking the Udemy course.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Python Core 50 Courses

    Python Core 50 Courses

    Structured learning path that organizes Python fundamentals

    Python-Core-50-Courses is a structured learning path that organizes Python fundamentals into 50 digestible lessons designed for steady, incremental progress. The curriculum starts with the basics—syntax, variables, data types, and control flow—then advances to functions, modules, object-oriented programming, and common standard-library utilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • The full-stack observability platform that protects your dataLayer, tags and conversion data Icon
    The full-stack observability platform that protects your dataLayer, tags and conversion data

    Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

    Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.
    Learn More
  • 5
    PythonPark

    PythonPark

    Python open source project "The Road to Self-Study Programming"

    PythonPark is a large, curated “learning playground” for Python — essentially a comprehensive self-study meta-repository aimed at helping learners progress in Python programming, data science, machine learning, web scraping, and software engineering practices. It aggregates tutorials, learning guides, project examples, and resources across topics: from Python basics and data structures to machine learning, web scraping, and even interview preparation and “programmer life” guidance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    freeCodeCamp

    freeCodeCamp

    freeCodeCamp.org's open-source codebase and curriculum

    freeCodeCamp is a nonprofit educational platform that offers a self-paced curriculum for learning web development, programming, data visualization, APIs, and algorithms. It features interactive coding challenges, real-world projects, and guided progress through topic modules, culminating in certificates for completed tracks. A key aspect is that students contribute to open-source projects for nonprofits or internal tooling as part of their learning, reinforcing both technical and...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 7
    Archivematica

    Archivematica

    Free and open-source digital preservation system

    Archivematica is a web- and standards-based, open-source application which allows your institution to preserve long-term access to trustworthy, authentic, and reliable digital content. Our target users are archivists, librarians, and anyone working to preserve digital objects. You are free to copy, modify, and distribute Archivematica with attribution under the terms of the AGPLv3 license. Archivematica is an open-source application based on recognized standards that makes it possible to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Build your own X

    Build your own X

    Master programming by recreating your favorite technologies

    build-your-own-x is a massive, community-curated roadmap of hands-on tutorials that teach you to re-implement complex systems from scratch—things like databases, compilers, operating systems, interpreters, web servers, neural networks, regex engines, and more. Rather than offering abstract theory, it organizes step-by-step guides by topic and by programming language, so you can pick a project that fits your stack and skill level. The focus is on demystifying internals: you don’t just use a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    CodeJo

    CodeJo

    Web-based introductory coding environment

    CodeJo is a web app that allows users to write Python code to control a web canvas element (a pixel grid) or another graphical display element. At this time, grid (web canvas) and fabric (a library that also uses a web canvas) are available, but new modules can be added to the system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Turn traffic into pipeline and prospects into customers Icon
    Turn traffic into pipeline and prospects into customers

    For account executives and sales engineers looking for a solution to manage their insights and sales data

    Docket is an AI-powered sales enablement platform designed to unify go-to-market (GTM) data through its proprietary Sales Knowledge Lake™ and activate it with intelligent AI agents. The platform helps marketing teams increase pipeline generation by 15% by engaging website visitors in human-like conversations and qualifying leads. For sales teams, Docket improves seller efficiency by 33% by providing instant product knowledge, retrieving collateral, and creating personalized documents. Built for GTM teams, Docket integrates with over 100 tools across the revenue tech stack and offers enterprise-grade security with SOC 2 Type II, GDPR, and ISO 27001 compliance. Customers report improved win rates, shorter sales cycles, and dramatically reduced response times. Docket’s scalable, accurate, and fast AI agents deliver reliable answers with confidence scores, empowering teams to close deals faster.
    Learn More
  • 10

    TOMUSS

    TOMUSS: The Online Multi User Simple Spreadsheet

    TOMUSS is an interactive web application (groupware) allowing multiple concurrent users to edit data tables. Its primary goal is the management of students grades.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    EulerSharp

    EulerSharp

    Euler Yet another proof Engine

    EYE [1] is a reasoning engine supporting the Semantic Web layers [2]. It performs controlled chaining and it supports Euler paths [3]. Via N3 [4] it is interoperable with Cwm [5]. [1] http://eulersharp.sourceforge.net/README [2] http://www.w3.org/DesignIssues/diagrams/sweb-stack/2006a [3] http://mathworld.wolfram.com/KoenigsbergBridgeProblem.html [4] http://www.w3.org/TeamSubmission/n3/ [5] http://www.w3.org/2000/10/swap/doc/cwm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Django LMS

    Django LMS

    A learning management system using django web framework

    django-lms is an open-source Learning Management System (LMS) built with Django and designed for ease of use and extensibility. It allows administrators to manage courses, lessons, quizzes, and users in an educational environment. The project includes a clean UI and backend tools to help educators create and track learning content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Project Based Learning

    Project Based Learning

    Curated list of project-based tutorials

    project-based-learning is a community-curated open source repository that compiles programming tutorials focused on building real-world applications from scratch. It organizes resources by programming languages such as Python, Java, JavaScript, C++, Go, Rust, and many others. Each tutorial emphasizes practical, hands-on learning through project development rather than theoretical study. The collection spans various domains including web development, game programming, systems programming, and machine learning. By following the projects, learners can strengthen problem-solving skills, gain experience with different technologies, and build portfolios. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    workshops_project

    workshops_project

    Workshops is an open source, simple, dead-lightweight LMS

    Workshops is an open source, simple, dead-lightweight LMS (Learning Management System) application programmed in Python (version 3.8.x) with Django (version 2.2.x) web framework which main purpose is to make a standarized way to share knowledge via courses in a slide-based view in browser powered by remark javascript library, easy to create, edit, delete and show your courses using simple markdown and html if necessary. Inspired on an old project in my social labours to help share knowledge in an easy way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Realtime rear lighting led strip for PC More videos: https://youtu.be/KOzteiKo_Z4 https://youtu.be/8Y_ZgxOdfG0 https://youtu.be/9YNoitDR2JE WARNING: If you are not comfortable working with electronics, soldering and wires, you can mess up your computer, router, Raspberry PI or some people might even find a way to burn down the house! (I watch YouTube, I know you guys are out there...) This is not a beginner project, it involves multiple pieces of equipment and various expertise....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Linkedin Skill assessments

    Linkedin Skill assessments

    Full reference of LinkedIn answers 2023 for skill assessments

    This repository is for those looking for answers to the LinkedIn assessment quiz questions or willing to help others by contributing to the tests. Or possibly you would like to create your first pull request and be added as a contributor. Add an explanation (or reference link) to your answers. That will help anyone to better learn concepts they might not have a great understanding of. Your resource for answers. In case you have doubts please contact this person or add them to review your PR.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Zero Install
    Zero Install is a decentralised cross-distribution software installation system. Create one package that works everywhere! With dependency handling and automatic updates, full support for shared libraries, and integration with native package managers
    Leader badge
    Downloads: 2,271 This Week
    Last Update:
    See Project
  • 18
    Python4Proteomics Course

    Python4Proteomics Course

    Python course for Proteomics analysis

    Python course (in Spanish) for Proteomics analysis using basically Jupyter NoteBooks. For more information, you can have a look at the readme.md file in the source code tree: https://sourceforge.net/p/lp-csic-uab/p4p/code/ci/default/tree/readme.md
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    GoodByeCatpcha

    GoodByeCatpcha

    Solver ReCaptcha v2 Free

    An async Python library to automate solving ReCAPTCHA v2 by images/audio using Mozilla's DeepSpeech, PocketSphinx, Microsoft Azure’s, Google Speech and Amazon's Transcribe Speech-to-Text API. Also image recognition to detect the object suggested in the captcha. Built with Pyppeteer for Chrome automation framework and similarities to Puppeteer, PyDub for easily converting MP3 files into WAV, aiohttp for async minimalistic web-server, and Python’s built-in AsyncIO for convenience.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    Tux Math Scrabble

    Tux Math Scrabble

    12÷16×3=9÷4

    Latest version 0.9.1 runs on Python3. Date: June 15, 2023 Online/touch-screen version: https://www.asymptopia.com/tuxmathscrabble Encourages kids to construct compound equations and consider multiple abstract possibilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Web Security Dojo

    Web Security Dojo

    Virtual training environment to learn web app ethical hacking.

    Web Security Dojo is a virtual machine that provides the tools, targets, and documentation to learn and practice web application security testing. A preconfigured, stand-alone training environment ideal for classroom and conferences. No Internet required to use. Ideal for those interested in getting hands-on practice for ethical hacking, penetration testing, bug bounties, and capture the flag (CTF). A single OVA file will import into VirtualBox and VMware. There is also an Ansible...
    Leader badge
    Downloads: 63 This Week
    Last Update:
    See Project
  • 22

    Syllog

    Syllog is a system for teaching logic, with anonymous answer logging

    Syllog is a system for teaching various forms of logic, including syllogistic logic. It consists of a student-facing Java Applet that runs in the student's browser, and a teacher-facing, server-based logging facility for recording the answers given by students. The student-facing part simply poses questions, to which the student must provide an answer. The teacher-oriented logging facility then logs the answers anonymously for further investigation by the teacher.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Mathmaker

    Generator of maths worksheets and their detailed solutions

    This project aims to produce elementary math worksheets & answers automatically. It is meant to be used - by pupils to get as many sheets as they want - by teachers to re-use the produced sheets in their own work. Documentation for users is there: http://mathmaker.readthedocs.io/en/master/user_index.html
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    TensorFlow World

    TensorFlow World

    Simple and ready-to-use tutorials for TensorFlow

    This repository aims to provide simple and ready-to-use tutorials for TensorFlow. The explanations are present in the wiki associated with this repository. There are different motivations for this open source project. TensorFlow (as we write this document) is one of / the best deep learning frameworks available. The question that should be asked is why has this repository been created when there are so many other tutorials about TensorFlow available on the web? Deep Learning is in very high...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    mBlock

    mBlock

    Base on scratch offline v2.0 by MIT

    mBlock is a visual programming platform based on Scratch 3.0 that helps students learn programming concepts through drag-and-drop blocks, Python, and IoT/AI modules. Developed by Makeblock, it supports programming robots and Arduino boards with an intuitive interface. Designed for education, it bridges the gap between beginner-friendly environments and real-world coding with Python, AI, and robotics tools.
    Downloads: 65 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB