Alternatives to tap

Compare tap alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to tap in 2026. Compare features, ratings, user reviews, pricing, and more from tap competitors and alternatives in order to make an informed decision for your business.

  • 1
    Tenzir

    Tenzir

    Tenzir

    ​Tenzir is a data pipeline engine specifically designed for security teams, facilitating the collection, transformation, enrichment, and routing of security data throughout its lifecycle. It enables users to seamlessly gather data from various sources, parse unstructured data into structured formats, and transform it as needed. It optimizes data volume, reduces costs, and supports mapping to standardized schemas like OCSF, ASIM, and ECS. Tenzir ensures compliance through data anonymization features and enriches data by adding context from threats, assets, and vulnerabilities. It supports real-time detection and stores data efficiently in Parquet format within object storage systems. Users can rapidly search and materialize necessary data and reactivate at-rest data back into motion. Tension is built for flexibility, allowing deployment as code and integration into existing workflows, ultimately aiming to reduce SIEM costs and provide full control.
  • 2
    Upsolver

    Upsolver

    Upsolver

    Upsolver makes it incredibly simple to build a governed data lake and to manage, integrate and prepare streaming data for analysis. Define pipelines using only SQL on auto-generated schema-on-read. Easy visual IDE to accelerate building pipelines. Add Upserts and Deletes to data lake tables. Blend streaming and large-scale batch data. Automated schema evolution and reprocessing from previous state. Automatic orchestration of pipelines (no DAGs). Fully-managed execution at scale. Strong consistency guarantee over object storage. Near-zero maintenance overhead for analytics-ready data. Built-in hygiene for data lake tables including columnar formats, partitioning, compaction and vacuuming. 100,000 events per second (billions daily) at low cost. Continuous lock-free compaction to avoid “small files” problem. Parquet-based tables for fast queries.
  • 3
    Dropbase

    Dropbase

    Dropbase

    Centralize offline data, import files, process and clean up data. Export to a live database with 1 click. Streamline data workflows. Centralize offline data and make it accessible to your team. Bring offline files to Dropbase. Multiple formats. Any way you like. Process and format data. Add, edit, re-order, and delete processing steps. 1-click exports. Export to database, endpoints, or download code with 1 click. Instant REST API access. Query Dropbase data securely with REST API access keys. Onboard data where you need it. Combine and process datasets to fit the desired format or data model. No code. Process your data pipelines using a spreadsheet interface. Track every step. Flexible. Use a library of pre-built processing functions. Or write your own. 1-click exports. Export to database or generate endpoints with 1 click. Manage databases. Manage and databases and credentials.
    Starting Price: $19.97 per user per month
  • 4
    ATA

    ATA

    ATA

    ATA is an AI-powered API management platform that centralizes design, testing, governance, documentation, and lifecycle workflows into a single intelligent workspace to help teams design, build, test, maintain, and govern APIs with higher quality and collaboration. It keeps API code, design documentation, specifications, test cases, and release notes in sync, reducing manual effort and drift while supporting OpenAPI specs, mock servers for frontend progress without backend readiness, and scheduled API monitoring to detect slow responses, timeouts, or failures early. It includes a Developer Studio for design-first OpenAPI creation and version control, E2E Test Automation with AI-generated robustness and security tests, mock servers, chained API workflows, and UI automation testing, and a Documentation Portal that centralizes API docs with multi-editor support, version management, secure access control, and auto-linked live endpoints.
  • 5
    Gentoro

    Gentoro

    Gentoro

    Gentoro is a platform built to empower enterprises to adopt agentic automation by bridging AI agents with real-world systems securely and at scale. It uses the Model Context Protocol (MCP) as its foundation, allowing developers to automatically convert OpenAPI specs or backend endpoints into production-ready MCP Tools, without writing custom integration code. Gentoro takes care of runtime concerns like logging, retries, monitoring, and cost optimization, while enforcing secure access, auditability, and governance policies (e.g., OAuth support, policy enforcement) whether deployed in a private cloud or on-premises. It is model- and framework-agnostic, meaning it supports integration with various LLMs and agent architectures. Gentoro helps avoid vendor lock-in and simplifies tool orchestration in enterprise environments by managing tool generation, runtime, security, and maintenance in one stack.
  • 6
    GitCode

    GitCode

    GitCode

    GitCode is a global open source community and code-hosting platform that mirrors and aggregates repositories to provide deep, fast code exploration and seamless project collaboration in one unified interface. At its core is an intelligent code search engine that lets you query open source projects, models, datasets, issues, pull requests, users, and organizations, complete with keyword filtering by language, stars, forks, update time, highlighted result,s and customizable sorting to surface exactly what you need in seconds. Beyond search, GitCode offers online project browsing with automatic empty-directory folding, a Markdown editor with full emoji support, and both table and Kanban board views for issues and task management. The robust permission matrix lets teams define interdependent, role-based access controls while avoiding configuration errors, and the natural-language OpenAPI endpoint exposes repository metadata for integration into custom workflows.
  • 7
    API Agent
    API Agent in IBM API Connect is a watsonx.ai–powered assistant that automates core tasks across the entire API lifecycle via a natural‑language, conversational interface. Built on an agentic framework, it lets teams rapidly generate OpenAPI specifications, mocked responses, and rich documentation for design‑first projects, or connect to backend data sources, build application code, and auto‑deploy to Code Engine for code‑first workflows, all without manual setup. To combat API sprawl, API Agent intelligently searches your existing API catalog by simple description prompts, recommending reusable endpoints and reducing duplication. It enforces governance by validating specs against organizational rulesets, suggesting or applying fixes automatically, and boosts quality with a built‑in testing suite that generates and runs semantic test cases to catch issues early.
  • 8
    Crow

    Crow

    Crow

    Crow, the language user interface for modern software, is a developer-focused platform that makes it easy to embed a fully functional AI copilot directly into your application with minimal effort. Instead of building a chatbot or copilot from scratch, wiring backend endpoints, designing UI, managing user state, handling context, and enabling tool calls, Crow handles all of that for you. You simply add a small script to your frontend, and Crow does the rest; it connects to your backend endpoints, converts registered APIs (via OpenAPI specs or endpoint URLs) into callable tools, and manages authentication so that AI-driven actions respect your existing user permissions. To give the copilot real context, Crow lets you ingest website content, documentation, or arbitrary files so the AI can answer domain-specific questions accurately. Once configured, the copilot can not only respond conversationally, but also execute actions, for example, reading or writing data.
  • 9
    Scalar

    Scalar

    Scalar

    Scalar is a modern, open source API platform designed to help developers create, document, test, and manage APIs through a unified, interactive environment built around the OpenAPI standard. It transforms OpenAPI specifications into clean, visually appealing, and interactive API documentation that allows users to explore endpoints and test requests directly within the interface, combining documentation and a full-featured API client in one place. It includes a built-in REST API client that supports sending requests, inspecting responses, handling authentication methods such as API keys and OAuth2, and working with environment variables and dynamic parameters. Scalar also offers tools for generating SDKs, managing and versioning API specifications with Git integration, and creating documentation that stays synchronized with the underlying API through Markdown or MDX workflows.
    Starting Price: Free
  • 10
    Globessey Data Server

    Globessey Data Server

    Adaptive Recognition

    Globessey Data Server (GDS) is a centralized data server and middleware solution designed to collect, manage, and visualize extensive traffic data from various endpoints. It seamlessly integrates with Adaptive Recognition's ANPR/LPR cameras and third-party devices, enabling efficient data aggregation and analysis. Built on the robust ELK stack, GDS offers secure data storage, user-friendly dashboards, and advanced visualization tools, including heatmaps and geofencing filters. Its intuitive interface facilitates easy deployment and device registration, while the documented OpenAPI and SDK samples support flexible development. GDS is compatible with both Windows and Linux operating systems, making it a versatile tool for traffic management, smart city infrastructure, and security applications.
  • 11
    ArangoDB

    ArangoDB

    ArangoDB

    Natively store data for graph, document and search needs. Utilize feature-rich access with one query language. Map data natively to the database and access it with the best patterns for the job – traversals, joins, search, ranking, geospatial, aggregations – you name it. Polyglot persistence without the costs. Easily design, scale and adapt your architectures to changing needs and with much less effort. Combine the flexibility of JSON with semantic search and graph technology for next generation feature extraction even for large datasets.
  • 12
    Konfig

    Konfig

    Konfig

    Konfig is a developer tool that automates the generation of SDKs, documentation, demos, and tutorials for REST APIs, facilitating seamless onboarding for external developers. By importing an OpenAPI Specification or Postman Collection, Konfig automatically produces SDKs in popular programming languages, including TypeScript, Python, Java, C#, PHP, Ruby, Go, Swift, and Dart. The platform ensures high-quality SDKs by identifying and rectifying errors in the OpenAPI Specification through its linter and writing test cases to prevent API updates from breaking existing SDKs. Konfig also generates branded, user-friendly documentation that auto-updates with any changes to the API specification, maintaining consistency between documentation and SDKs. Additionally, it allows for the creation of engaging demos and tutorials using familiar Markdown, enabling users to run code in-browser for hands-on learning.
  • 13
    Sliq

    Sliq

    Sliq

    Sliq is an AI-powered data cleaning platform that transforms messy raw datasets into clean, analysis-ready data in minutes by automatically detecting and fixing common quality issues such as incorrect formats, missing values, schema inconsistencies, and formatting errors, so analysts and engineers spend less time on “janitor work” and more time on insights and modeling. It uses context-aware intelligence to understand the semantic domain of uploaded data (for example, whether it’s financial records, ecommerce logs, or medical data) and tailors a cleaning plan specifically for that dataset instead of applying one-size-fits-all rules. Users can upload files directly or integrate with workflows programmatically, and Sliq supports common data formats, including CSV, JSON, and Parquet, while seamlessly integrating into existing data ecosystems.
    Starting Price: $30
  • 14
    Apache DataFusion

    Apache DataFusion

    Apache Software Foundation

    Apache DataFusion is an extensible, high-performance query engine written in Rust that utilizes Apache Arrow as its in-memory format. Designed for developers building data-centric systems such as databases, data frames, machine learning, and streaming applications, DataFusion offers SQL and DataFrame APIs, a vectorized, multi-threaded, streaming execution engine, and support for partitioned data sources. It natively supports formats like CSV, Parquet, JSON, and Avro, and allows for seamless integration with object stores including AWS S3, Azure Blob Storage, and Google Cloud Storage. The engine features a comprehensive query planner, a state-of-the-art optimizer with capabilities like expression coercion and simplification, projection and filter pushdown, sort and distribution-aware optimizations, and automatic join reordering. DataFusion is highly customizable, enabling the addition of user-defined scalar, aggregate, and window functions, custom data sources, query languages, etc.
    Starting Price: Free
  • 15
    Mobula

    Mobula

    Mobula Labs

    Mobula provides curated datasets for builders: market data with Octopus, wallets data, metadata with Metacore, alongside with REST, GraphSQL & SQL interfaces to query them. You can get started playing around with the API endpoints for free, and sign-up to the API dashboard once you need API keys (queries without API keys aren’t production-ready). Get in touch with the team if you have questions, ideas, feedbacks or needs!
  • 16
    Axibase Time Series Database
    Parallel query engine with time- and symbol-indexed data access. Extended SQL syntax with advanced filtering and aggregations. Consolidate quotes, trades, snapshots, and reference data in one place. Strategy backtesting on high-frequency data. Quantitative and market microstructure research. Granular transaction cost analysis and rollup reporting. Market surveillance and anomaly detection. Non-transparent ETF/ETN decomposition. FAST, SBE, and proprietary protocols. Plain text protocol. Consolidated and direct feeds. Built-in latency monitoring tools. End-of-day archives. ETL from institutional and retail financial data platforms. Parallel SQL engine with syntax extensions. Advanced filtering by trading session, auction stage, index composition. Optimized aggregates for OHLCV and VWAP calculations. Interactive SQL console with auto-completion. API endpoint for programmatic integration. Scheduled SQL reporting with email, file, and web delivery. JDBC and ODBC drivers.
  • 17
    Google Cloud Endpoints
    Develop, deploy, protect, and monitor your APIs with Cloud Endpoints. An NGINX-based proxy and distributed architecture give unparalleled performance and scalability. Using an OpenAPI Specification or one of our API frameworks, Cloud Endpoints gives you the tools you need for every phase of API development and provides insight with Cloud Logging, Cloud Monitoring, and Cloud Trace. Control who has access to your API and validate every call with JSON Web Tokens and Google API keys. Integration with Auth0 and Firebase Authentication lets you identify the users of your web or mobile application. Extensible Service Proxy delivers security and insight in less than 1 ms per call. Deploy your API automatically with App Engine and Google Kubernetes Engine, or add our proxy container to your Kubernetes deployment. Monitor critical operations metrics in Google Cloud Console, and gain insight into your users and usage with Cloud Trace, Cloud Logging, and BigQuery.
  • 18
    42Crunch

    42Crunch

    42Crunch

    Your most valuable intelligence isn’t AI, it’s your developers. Empower them with tools to be the driving force behind API security – ensuring continuous, unparalleled protection across the entire API lifecycle. Push your OpenAPI definition to your CI/CD pipeline and automatically audit, scan and protect your API. Audit your OpenAPI / Swagger file against 300+ security vulnerabilities, we’ll rank them by severity level and tell you exactly how to fix them – making security a seamless part of your development lifecycle Enforce a zero-trust architecture by ensuring all your APIs meet a set security standard before production, scan the live API endpoints for potential vulnerabilities, and automate redeployment. Ensure security of all your APIs from design to deployment, get detailed insight about attacks on APIs in production – and protect against threats – without impacting performance.
  • 19
    create-api.dev
    Create-API.dev is an AI-powered OpenAPI specification builder by Kong that lets you generate high-quality API specs in seconds through a simple chat interface. By messaging the service, you provide your desired endpoints or rough outline, and an underlying Google LLM crafts complete, standards-compliant OpenAPI definitions that are ready to share, test, and ship. As a lightweight, web-based tool, it requires no installation. The generated specs can be exported in standard YAML or JSON formats for seamless integration with your existing API gateways and documentation pipelines. Create-API.dev operates under Google’s Generative AI Prohibited Use Policy and advises discretion before relying on or publishing any AI-generated content.
    Starting Price: Free
  • 20
    Datazoom

    Datazoom

    Datazoom

    Improving the experience, efficiency, and profitability of streaming video requires data. Datazoom enables video publishers to better operate distributed architectures through centralizing, standardizing, and integrating data in real-time to create a more powerful data pipeline and improve observability, adaptability, and optimization solutions. Datazoom is a video data platform that continually gathers data from endpoints, like a CDN or a video player, through an ecosystem of collectors. Once the data is gathered, it is normalized using standardized data definitions. This data is then sent through available connectors to analytics platforms like Google BigQuery, Google Analytics, and Splunk and can be visualized in tools such as Looker and Superset. Datazoom is your key to a more effective and efficient data pipeline. Get the data you need in real-time. Don’t wait for your data when you need to resolve an issue immediately.
  • 21
    Y42

    Y42

    Datos-Intelligence GmbH

    Y42 is the first fully managed Modern DataOps Cloud. It is purpose-built to help companies easily design production-ready data pipelines on top of their Google BigQuery or Snowflake cloud data warehouse. Y42 provides native integration of best-of-breed open-source data tools, comprehensive data governance, and better collaboration for data teams. With Y42, organizations enjoy increased accessibility to data and can make data-driven decisions quickly and efficiently.
  • 22
    Kiota

    Kiota

    Microsoft

    Kiota is a client, plugin, and manifest generator for HTTP REST APIs described by OpenAPI. Available as a command-line tool and a Visual Studio Code (VS Code) extension, Kiota enables developers to search for API descriptions, filter and select specific endpoints, and generate models and a chained method API surface in various programming languages. This approach eliminates the need to depend on different API clients for each API and allows for precise generation of the required API surface area. Additionally, Kiota facilitates participation in the Microsoft Copilot ecosystem by enabling the generation of API plugins. The VS Code extension enhances the Kiota experience with a rich user interface, supporting features such as searching for API descriptions, filtering endpoints, and generating API clients. Users can select desired endpoints and generate clients, plugins, or other outputs, with completion notifications and easy access to generated outputs within the workspace.
    Starting Price: Free
  • 23
    OpenObserve

    OpenObserve

    OpenObserve

    OpenObserve is an open source observability platform for logs, metrics, and traces that emphasizes high performance, scalability, and dramatically lower cost. It supports petabyte-scale observability thanks to features like data compression using columnar storage and the ability to use “bring your own bucket” storage (local disk, S3, GCS, Azure Blob, etc.). It is written in Rust, uses the DataFusion query engine to directly query Parquet files, and provides a stateless, horizontally scalable architecture with caching (both result and disk) to maintain speed under heavy load. It embraces open standards (OpenTelemetry compatibility, vendor-neutral APIs), so it fits into existing monitoring/logging workflows. Key modules include logs, metrics, traces, frontend monitoring, pipelines, alerts, and dashboards/visualizations.
    Starting Price: $0.30 per GB
  • 24
    RxDB

    RxDB

    RxDB

    ​RxDB is a local-first, NoSQL JavaScript database optimized for modern web and mobile applications. It enables offline-first functionality by storing data directly on the client using storage engines like IndexedDB, OPFS, SQLite, and more. RxDB offers real-time reactivity, allowing developers to subscribe to changes in documents, fields, or queries, ensuring that UI components update automatically as data changes. Its flexible replication engine supports syncing with various backends and custom endpoints. RxDB integrates seamlessly with frameworks and environments. Additional features include field-level encryption, schema validation, conflict resolution, backup and restore, attachments, and CRDT support. By reducing server load and providing low-latency local queries, RxDB enhances performance and scalability, making it ideal for applications that require real-time updates, offline access, and cross-platform consistency.
    Starting Price: Free
  • 25
    Oarkflow

    Oarkflow

    Oarkflow

    Automate your business pipeline with our flow builder. Use operations that matters to you. Bring your own service providers for email, sms and http services. Use our advanced query builder to query and analyze csv with any field numbers and rows. We store the csv files you've uploaded on our platform in a secured vault and account activity logs. We don't store any data records you request for processing.
    Starting Price: $0.0005 per task
  • 26
    Tad

    Tad

    Tad

    ​Tad is a free (MIT Licensed) desktop application for viewing and analyzing tabular data. It is a fast viewer for CSV and Parquet files and SQLite and DuckDb databases that support large files. It's a Pivot Table for analyzing and exploring data. Internally, Tad uses DuckDb for fast, accurate processing. Designed to fit into the workflow of data engineers and data scientists. Tad includes updates to DuckDb 1.0, the ability to export filtered tables as Parquet (as well as CSV), a fix for formatting numbers in scientific notation, and other minor bug fixes and dependent package upgrades. A packaged installer for Tad is available for macOS (x86 and Apple Silicon), Linux, and Windows.
    Starting Price: Free
  • 27
    StreamScape

    StreamScape

    StreamScape

    Make use of Reactive Programming on the back-end without the need for specialized languages or cumbersome frameworks. Triggers, Actors and Event Collections make it easy to build data pipelines and work with data streams using simple SQL-like syntax, shielding users from the complexities of distributed system development. Extensible Data Modeling is a key feature that supports rich semantics and schema definition for representing real-world things. On-the-fly validation and data shaping rules support a variey of formats like XML and JSON, allowing you to easily describe and evolve your schema, keeping pace with changing business requirements. If you can describe it, we can query it. Know SQL and Javascript? Then you already know how to use the data engine. Whatever the format, a powerful query language lets you instantly test logic expressions and functions, speeding up development and simplifying deployment for unmatched data agility.
  • 28
    Google Earth Engine
    Google Earth Engine is a cloud-based platform for scientific analysis and visualization of geospatial datasets, providing access to a vast public data archive that includes over 90 petabytes of analysis-ready satellite imagery and more than 1,000 curated geospatial datasets. This extensive catalog encompasses over 50 years of historical imagery, updated daily, with resolutions as fine as one meter per pixel, featuring datasets such as Landsat, MODIS, Sentinel, and the National Agriculture Imagery Program (NAIP). Earth Engine enables users to analyze Earth observation data and apply machine learning techniques through its web-based JavaScript Code Editor and Python API, facilitating the development of complex geospatial workflows. The platform's integration with Google Cloud allows for large-scale parallel processing, empowering users to conduct comprehensive analyses and visualize Earth data efficiently. Additionally, Earth Engine offers interoperability with BigQuery.
    Starting Price: $500 per month
  • 29
    One Auto API

    One Auto API

    One Auto API

    One Auto API is a unified automotive data platform that simplifies access to a broad range of vehicle and market information by consolidating multiple leading automotive data providers into a single, developer-friendly API. Rather than integrating separate data sources with different formats and contracts, users can query one endpoint to retrieve rich automotive datasets, including vehicle specifications, retail pricing, valuations, technical details, market metrics, registration lookups, tyre fitments, and history checks from partners like Auto Trader, Experian, Brego, DriveRightData, and others. It offers comprehensive SDKs in over 30 languages, sandbox access, and OpenAPI/Swagger documentation to help developers integrate data quickly into apps, websites, dealer portals, and software systems, reducing engineering overhead and accelerating product delivery.
  • 30
    Dataform

    Dataform

    Google

    Dataform enables data analysts and data engineers to develop and operationalize scalable data transformation pipelines in BigQuery using only SQL from a single, unified environment. Its open source core language lets teams define table schemas, configure dependencies, add column descriptions, and set up data quality assertions within a shared code repository while applying software development best practices, version control, environments, testing, and documentation. A fully managed, serverless orchestration layer automatically handles workflow dependencies, tracks lineage, and executes SQL pipelines on demand or via schedules in Cloud Composer, Workflows, BigQuery Studio, or third-party services. In the browser-based development interface, users get real-time error feedback, visualize dependency graphs, connect to GitHub or GitLab for commits and code reviews, and launch production-grade pipelines in minutes without leaving BigQuery Studio.
    Starting Price: Free
  • 31
    CharityBase

    CharityBase

    CharityBase

    CharityBase is a free, open source database and GraphQL API that aggregates fragmented data from the Charity Commission, Companies House, 360 Giving, charity websites, the ONS, and social media into a single, cleaned, normalized, and searchable dataset. It powers a public search portal where users can discover detailed profiles of UK charities, complete with finance, governance, and activity data, and offers a single GraphQL endpoint that responds in structured JSON with custom queries for counts, aggregates, and detailed lists. Designed to eliminate the heavy lifting of data collection and cleaning, CharityBase enables startups, grantmakers, and researchers to build digital tools, such as dashboards, reports, and grant-finding applications, without managing their own data pipeline. Its API supports both GET and POST requests, variable-driven queries, pagination, and live interactive playgrounds for rapid prototyping, all underpinned by regularly updated, audit-trail-backed records.
    Starting Price: Free
  • 32
    Apache Pinot

    Apache Pinot

    Apache Corporation

    Pinot is designed to answer OLAP queries with low latency on immutable data. Pluggable indexing technologies - Sorted Index, Bitmap Index, Inverted Index. Joins are currently not supported, but this problem can be overcome by using Trino or PrestoDB for querying. SQL like language that supports selection, aggregation, filtering, group by, order by, distinct queries on data. Consist of of both offline and real-time table. Use real-time table only to cover segments for which offline data may not be available yet. Detect the right anomalies by customizing anomaly detect flow and notification flow.
  • 33
    IBM Cloud SQL Query
    Serverless, interactive querying for analyzing data in IBM Cloud Object Storage. Query your data directly where it is stored, there's no ETL, no databases, and no infrastructure to manage. IBM Cloud SQL Query uses Apache Spark, an open-source, fast, extensible, in-memory data processing engine optimized for low latency and ad hoc analysis of data. No ETL or schema definition needed to enable SQL queries. Analyze data where it sits in IBM Cloud Object Storage using our query editor and REST API. Run as many queries as you need; with pay-per-query pricing, you pay only for the data scan. Compress or partition data to drive savings and performance. IBM Cloud SQL Query is highly available and executes queries using compute resources across multiple facilities. IBM Cloud SQL Query supports a variety of data formats such as CSV, JSON and Parquet, and allows for standard ANSI SQL.
    Starting Price: $5.00/Terabyte-Month
  • 34
    Tinybird

    Tinybird

    Tinybird

    Query and shape your data using Pipes, a new way to chain SQL queries inspired by Python Notebooks. Designed to reduce complexity without sacrificing performance. By splitting your query in different nodes you simplify development and maintenance. Activate your production-ready API endpoints with one click. Transformations occur on-the-fly so you will always work with the latest data. Share access securely to your data in one click and get fast and consistent results. Apart from providing monitoring tools, Tinybird scales linearly: don't worry about traffic spikes. Imagine if you could turn, in a matter of minutes, any Data Stream or CSV file into a fully secured real-time analytics API endpoint. We believe in high-frequency decision-making for all organizations in all industries including retail, manufacturing, telecommunications, government, advertising, entertainment, healthcare, and financial services.
    Starting Price: $0.07 per processed GB
  • 35
    AutoRest

    AutoRest

    Microsoft

    The AutoRest tool generates client libraries for accessing RESTful web services. Input to AutoRest is a spec that describes the REST API using the OpenAPI specification format and streamlines the creation of client code across multiple programming languages, including C#, Java, Python, TypeScript, and Go. This automation enhances consistency and efficiency in API consumption, reducing the manual effort required to develop and maintain client libraries. AutoRest operates through a flexible pipeline that processes OpenAPI input files, transforming them into a code model which is then utilized by language-specific generators to produce client code adhering to each language's design guidelines. The tool supports both OpenAPI 2.0 and 3.0 specifications, ensuring compatibility with a wide range of APIs. Developers can install AutoRest on Windows, macOS, or Linux systems, with installation facilitated via Node.js.
    Starting Price: Free
  • 36
    Apache Parquet

    Apache Parquet

    The Apache Software Foundation

    We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem. Parquet is built from the ground up with complex nested data structures in mind, and uses the record shredding and assembly algorithm described in the Dremel paper. We believe this approach is superior to simple flattening of nested namespaces. Parquet is built to support very efficient compression and encoding schemes. Multiple projects have demonstrated the performance impact of applying the right compression and encoding scheme to the data. Parquet allows compression schemes to be specified on a per-column level, and is future-proofed to allow adding more encodings as they are invented and implemented. Parquet is built to be used by anyone. The Hadoop ecosystem is rich with data processing frameworks, and we are not interested in playing favorites.
  • 37
    OpenAPI Generator

    OpenAPI Generator

    OpenAPI Generator

    OpenAPI Generator is an open-source tool that automatically generates client libraries, server stubs, API documentation, and configuration files from an OpenAPI Specification (OAS) document. It supports a wide range of programming languages and frameworks, making it easier for developers to integrate APIs into their applications. By automating the creation of boilerplate code, OpenAPI Generator reduces development time and ensures consistency in API interactions. It allows teams to focus on implementing business logic rather than dealing with repetitive tasks like data serialization, deserialization, and HTTP request handling. The tool is widely used in API-driven development, enabling seamless integration of third-party services and simplifying the process of keeping API consumers and providers in sync.
    Starting Price: Free
  • 38
    BigLake

    BigLake

    Google

    BigLake is a storage engine that unifies data warehouses and lakes by enabling BigQuery and open-source frameworks like Spark to access data with fine-grained access control. BigLake provides accelerated query performance across multi-cloud storage and open formats such as Apache Iceberg. Store a single copy of data with uniform features across data warehouses & lakes. Fine-grained access control and multi-cloud governance over distributed data. Seamless integration with open-source analytics tools and open data formats. Unlock analytics on distributed data regardless of where and how it’s stored, while choosing the best analytics tools, open source or cloud-native over a single copy of data. Fine-grained access control across open source engines like Apache Spark, Presto, and Trino, and open formats such as Parquet. Performant queries over data lakes powered by BigQuery. Integrates with Dataplex to provide management at scale, including logical data organization.
    Starting Price: $5 per TB
  • 39
    Unfolded

    Unfolded

    Unfolded

    Transform your geospatial data into insightful maps within minutes. Use our extensive layer catalog and advanced timeline animation capabilities. Slice and dice your data using our intuitive geospatial analytic capabilities. Quickly arrive at insights through fluid in-browser exploration with immediate visual feedback. Publish your maps to your team with the click of a button. Make your own stories and share impactful data narratives with the world. An intuitive user experience that makes complex geospatial data science easy. Combine Shapefiles, Vector Tiles and Cloud-Optimized GeoTIFFs with traditional data formats like CSV and GeoJSON. Analyze your data by joining tables and grouping rows. Cross-filter data and correlate columns using custom metrics. Build polished web applications on top of your published maps. Iterate quickly with our well-documented and easy-to-use API. Perform geospatial joins across different geospatial data types.
  • 40
    RediSearch
    Redis Enterprise includes a powerful real-time indexing, querying, and full-text search engine available on-premises and as a managed service in the cloud. Redis real-time search supports fast indexing and ingestion. It’s engineered for performance using in-memory data structures implemented in C. Scale out and partition indexes over several shards and nodes for greater speed and memory capacity. Enjoy continued operations in any scenario with five-nines availability and Active-Active failover. Redis Enterprise real-time search allows you to quickly create primary and secondary indexes on Hash and JSON datasets using an incremental indexing approach for fast index creation and deletion. The indexes let you query data at top speed, perform complex aggregations, filter by properties, numeric ranges as well as geographical distance.
  • 41
    EraDB

    EraDB

    Era Software

    EraDB is a database architecture built on the core principles of decoupled storage and compute, true zero-schema data storage, and flexible indexing powered by machine learning, all of which allow you to significantly reduce the size, cost, and complexity of your data while still giving you lightning-fast queries across vast datasets. We automatically index on every dimension, so you never have to decide today what you want to query tomorrow EraDB is schemaless by design, so we can store your data regardless of whether it’s consistently-structured. Built for flexibility, EraDB supports pluggable front- and back-end systems. Technologically-limited storage engines cannot efficiently handle complex data, so they crash or slow to a crawl.
  • 42
    Microsoft Graph Data Connect
    Microsoft Graph is your organization's gateway to Microsoft 365 data for productivity, identity, and security. Microsoft Graph Data Connect enables developers to copy select Microsoft 365 datasets into Azure data stores in a secure and scalable way. It's ideal for training machine learning and AI models that uncover rich organizational insights and deliver new value to analytics solutions. Copy data at scale from a Microsoft 365 tenant and move it directly into Azure Data Factory without writing code. Get the data you need, delivered to your application on a repeatable schedule, in just a few simple steps. Control how your organization's data is accessed with the Microsoft Graph Data Connect granular consent model. It requires that developers specify exactly what types of data or filter content their application will access. Likewise, administrators must give explicit approval to access Microsoft 365 data before access is granted.
    Starting Price: $0.75 per 1K objects extracted
  • 43
    Google Cloud Data Fusion
    Open core, delivering hybrid and multi-cloud integration. Data Fusion is built using open source project CDAP, and this open core ensures data pipeline portability for users. CDAP’s broad integration with on-premises and public cloud platforms gives Cloud Data Fusion users the ability to break down silos and deliver insights that were previously inaccessible. Integrated with Google’s industry-leading big data tools. Data Fusion’s integration with Google Cloud simplifies data security and ensures data is immediately available for analysis. Whether you’re curating a data lake with Cloud Storage and Dataproc, moving data into BigQuery for data warehousing, or transforming data to land it in a relational store like Cloud Spanner, Cloud Data Fusion’s integration makes development and iteration fast and easy.
  • 44
    Swagger

    Swagger

    SmartBear

    Simplify API development for users, teams, and enterprises with the Swagger open source and professional toolset. Find out how Swagger can help you design and document your APIs at scale. The power of Swagger tools starts with the OpenAPI Specification — the industry standard for RESTful API design. Individual tools to create, update and share OpenAPI definitions with consumers. SwaggerHub is the platform solution to support OpenAPI workflows at scale. Swagger open source and pro tools have helped millions of API developers, teams, and organizations deliver great APIs. Swagger offers the most powerful and easiest to use tools to take full advantage of the OpenAPI Specification.
  • 45
    BackAnt

    BackAnt

    BackAnt

    BackAnt is an AI-native backend development tool designed to automatically generate production-ready APIs and backend infrastructure from simple specifications or prompts. It works primarily as a command-line interface that scaffolds complete backend applications using the Flask framework, allowing developers to generate fully structured projects in seconds instead of building them manually. When a user runs a generation command or provides a JSON specification describing the required endpoints and data structure, the system automatically creates the key components of a backend application, including API routes, business-logic services, data repositories, database models, and application startup configuration. The generated project follows a layered architecture that separates routing, business logic, and data access, helping maintain clean and scalable code organization.
    Starting Price: $15 per month
  • 46
    QuerySurge
    QuerySurge leverages AI to automate the data validation and ETL testing of Big Data, Data Warehouses, Business Intelligence Reports and Enterprise Apps/ERPs with full DevOps functionality for continuous testing. Use Cases - Data Warehouse & ETL Testing - Hadoop & NoSQL Testing - DevOps for Data / Continuous Testing - Data Migration Testing - BI Report Testing - Enterprise App/ERP Testing QuerySurge Features - Projects: Multi-project support - AI: automatically create datas validation tests based on data mappings - Smart Query Wizards: Create tests visually, without writing SQL - Data Quality at Speed: Automate the launch, execution, comparison & see results quickly - Test across 200+ platforms: Data Warehouses, Hadoop & NoSQL lakes, databases, flat files, XML, JSON, BI Reports - DevOps for Data & Continuous Testing: RESTful API with 60+ calls & integration with all mainstream solutions - Data Analytics & Data Intelligence:  Analytics dashboard & reports
  • 47
    Inflectiv

    Inflectiv

    Inflectiv

    Inflectiv is a data platform that converts raw files into structured datasets designed for AI agents and automation. Users can upload PDFs, documents, spreadsheets, JSON files, and websites. Inflectiv automatically structures this information so it can be queried through APIs, SDKs, or built-in chat agents. Instead of parsing unstructured documents, AI agents work directly with datasets that support filtering, querying, and reliable responses. Inflectiv supports building Q&A chatbots, Discord and Telegram bots, internal knowledge assistants, and dataset-powered applications. Datasets can be kept private, shared with teams, or published to the marketplace for others to use. Creators retain full ownership of their data and control access, permissions, and monetization. The platform is suitable for both technical and non-technical users who want to turn existing knowledge into reusable AI-ready intelligence without custom ingestion pipelines.
    Starting Price: $29.99
  • 48
    Bazze

    Bazze

    Bazze

    Bazze is an AI-powered intelligence targeting and early-warning platform that transforms vast unclassified commercial data into mission-relevant insights on demand. Its Commercial Data Infrastructure (CDI) marketplace delivers real-time and historical datasets, ranging from device locations and satellite imagery to open source intelligence, via a “query in place” API model, eliminating the need for bulk purchases. Users can discover and integrate data from an expanding array of sources, apply advanced filtering and proprietary intent scores, and visualize results through custom dashboards or export them for downstream analysis. Specialized tools include reverse DNS mapping, geospatial event detection, trend tracking, threat scoring, and similarity searches to identify related entities. Everything is updated continuously and delivered on a consumption basis to optimize resource allocation.
  • 49
    Galileo

    Galileo

    GISDATA

    Galileo by GISDATA.io revolutionizes the way geospatial data is accessed and utilized. By aggregating datasets from numerous sources into a single, user-friendly platform, Galileo offers advanced spatial and metadata-specific search capabilities, ensuring quick and precise access to the most relevant and up-to-date geospatial information. Designed for professionals in surveying, mapping, engineering, environmental consulting, and scientific consulting industries, Galileo enhances efficiency and decision-making by streamlining the data discovery process. The platform is powered by a proprietary data discovery and indexing pipeline that guarantees regular updates and comprehensive data coverage. With thousands of datasets, including 750,000 from ESRI servers, Galileo provides an extensive and robust data collection. Galileo stands out with its intuitive interface, making it accessible for both novice and experienced users, and its ability to save valuable time and resources.
    Starting Price: $5
  • 50
    Metal

    Metal

    Metal

    Metal is your production-ready, fully-managed, ML retrieval platform. Use Metal to find meaning in your unstructured data with embeddings. Metal is a managed service that allows you to build AI products without the hassle of managing infrastructure. Integrations with OpenAI, CLIP, and more. Easily process & chunk your documents. Take advantage of our system in production. Easily plug into the MetalRetriever. Simple /search endpoint for running ANN queries. Get started with a free account. Metal API Keys to use our API & SDKs. With your API Key, you can use authenticate by populating the headers. Learn how to use our Typescript SDK to implement Metal into your application. Although we love TypeScript, you can of course utilize this library in JavaScript. Mechanism to fine-tune your spp programmatically. Indexed vector database of your embeddings. Resources that represent your specific ML use-case.
    Starting Price: $25 per month