App store for web apps

Find the right software and services.

WebCatalog Desktop

Turn websites into desktop apps with WebCatalog Desktop, and access a wealth of exclusive apps for Mac, Windows. Use spaces to organize apps, switch between multiple accounts with ease, and boost your productivity like never before.

Top Machine Learning Data Catalog Software - United States

Machine learning data catalogs enable organizations to organize, access, interpret, and collaborate around data from multiple sources while ensuring robust governance and access control. Artificial intelligence plays a central role in many features of these catalogs, supporting capabilities like machine learning-based recommendations, natural language queries, and dynamic data masking for improved security. These catalogs allow businesses to consolidate datasets in a single location, making it easier for both analysts and everyday users to search for and discover data. Users can comment on, share, and recommend datasets, providing immediate context for colleagues who are querying the data. IT administrators can implement user provisioning to prevent unauthorized access to sensitive information. Machine learning data catalogs are particularly beneficial for companies with diverse data sources, seeking a unified source of truth, and aiming to scale data usage across the organization. While IT departments typically manage these platforms to maintain organization and security, the catalogs are designed to be accessible to data scientists, analysts, and even non-technical business users. Data can be transformed, modeled, and visualized either within the catalog itself or through integration with business intelligence tools. It’s important to note that not all machine learning data catalogs include data preparation features and may require integration with business intelligence platforms for such capabilities. Additionally, these catalogs differ from master data management (MDM) systems in their focus on enhanced governance, collaboration, and machine learning-powered functionalities.

Submit New App


Appen

Appen

appen.com

Unlock Generative AI with Appen. Power exceptional customer experiences with our industry-leading products, depth of expertise and unmatched global team of AI Training Specialists. We’re your trusted data partner, enabling the most innovative companies to execute world-class AI initiatives.

TextQL

TextQL

textql.com

TextQL serves as a personal, virtual data analyst designed for enterprises. This AI-driven platform allows users to seek business insights through natural English queries. TextQL's technology, embodied by the AI named Ana, constructs comprehensive analyses, creates visual representations of data, and generates robust models. The unique feature of TextQL is its integration within a team's preexisting data platforms, which enables Ana to function where the team is already active. This includes collaborating through platforms like Slack and Teams.TextQL finds its usability across business intelligence systems, serving as a primary point of contact to locate any metric or dashboard. Moreover, it prevents redundancy in dashboard creation by retrieving any existing dashboard. Ana can also manage an enterprise's entire data catalog. It can index various locations where messy metadata might be stored, surface definitions from any stored location with verified links, and recognize different definitional uses across teams.TextQL employs a language learning model fluent in SQL and Python and can be configured to adhere to any compliance standard. This allows for secure and compliant deployments. Workflows are designed to suit an organization's needs, and industry-leading guardrails enable data anonymization, ensuring privacy. This makes TextQL a powerful tool with expansive data integration, analysis, and management capabilities that cater to various industries.

Sama AI

Sama AI

sama.com

Sama is a globally recognized leader in data annotation solutions for enterprise AI models that require the highest accuracy. We are the only computer vision solutions company with an in-house expert workforce using its own enterprise-grade platform. Our mission is to accelerate and advance computer vision AI development by providing the most accurate, scalable, and ethical data pipeline. Ethical AI is responsible AI, and as a Certified B-Corp, we’ve pioneered an impact model that harnesses the power of markets for social good, and has been proven to meaningfully improve employment and income outcomes for those with the greatest barriers to formal work. So far, helping more than 60,000 people lift themselves out of poverty.

Shaip

Shaip

shaip.com

Shaip provides high-quality data across multiple data types (text, audio, image & video) to companies looking to build non-biased and high quality AI/ML models. Shaip licenses, collects and annotates data for Healthcare, Conversational AI, Computer Vision and Generative AI/LLM use cases. Going beyond data, Shaip offers a complete Responsible LLM Toolkit to align, evaluate, and enhance large language models using reinforcement learning from human feedback (RLHF). Headquartered in Kentucky with offices in Silicon Valley and India, our global team blends data science expertise with deep industry knowledge. We enable partners to deploy AI they can trust--and that reflects the diversity of the people it impacts.

Denodo

Denodo

denodo.com

We enable organizations to connect to all of their data in real-time. Denodo is the leader in logical data fabric powered by data virtualization providing data access, data governance, and data delivery capabilities across the broadest range of enterprise, cloud, big data, and unstructured data sources without moving the data from their original repositories. Denodo’s customers across every major industry have gained significant business agility and ROI. The Denodo Platform offers an active data catalog for semantic search and enterprise-wide data governance, industry-leading smart query acceleration powered by AI, automated cloud infrastructure management for multi-cloud and hybrid deployments, and embedded data preparation capabilities for self-service yet well-governed and secure analytics. Denodo provides a unique approach to data integration and management not found in any other technology. Denodo customers reported: 83% increase in business user productivity 67% reduction in data preparation effort 65% decrease in data delivery time vs. ETL resulting in a three-year benefit of $6.8M, ROI of 408%, and payback within six months.

CastorDoc

CastorDoc

castordoc.com

CastorDoc is a collaborative, automated data discovery & catalog tool. We believe that data people spend way too much time trying to find and understand their data. CastorDoc redesigns how data people collaborate. It provides a single source of truth to reference and document all the knowledge related to data within your company. If you are looking for a table related to your customers, just look for it as you would in Google, and CastorDoc provides you with all the context you will need for your analysis. Inspired by internal tools developed by Uber, Airbnb, Lyft, and Spotify, CastorDoc has developed a plug-and-play solution that deploys in minutes to drive value for companies of all sizes. Discover and catalog your data today with CastorDoc.

data.world

data.world

data.world

data.world is the most-adopted data catalog and governance platform on the market. Built on a unique knowledge graph foundation, data.world seamlessly integrates with your existing systems. We set the standard for swift, people-centric governance. We don't just manage data; we unlock its potential, paving the way for responsible AI adoption and data-driven decision-making at scale. data.world is a Certified B Corporation and public benefit corporation and home to the world’s largest collaborative open data community with more than two million members, including ninety percent of the Fortune 500.

Collibra

Collibra

collibra.com

Collibra is a data catalog platform and tool that helps organizations better understand and manage their data assets. Collibra helps create an inventory of data assets, capture information (metadata) about them, and govern these assets. At its core, the Collibra tool is used for helping stakeholders understand what data assets exist, what they are made of, how they are being used, and if they are in regulatory compliance. Collibra unites your entire organization with trusted data that's easy to find, understand and access so you can do more with your data. And with new artificial intelligence (AI) use cases taking shape every day, AI governance is more critical than ever — learn how you can start your AI governance journey with Collibra. Collibra has four major functional areas: * Data catalog – This module provides an inventory of data assets and allows users to find and discover the right assets to use for different purposes. Users can search across several different facets of the data assets. * Data governance – The governance capabilities help create a common understanding of and sharing information about data assets. This includes both technical metadata and user-added information. * Data lineage – Allows users to see how data assets are created and molded as they move from system to system. Lineage helps data owners track what makes up a data asset for compliance and users to see where an asset comes from and how it is shaped. * Data privacy – The privacy module allows privacy and security teams to create, manage and run policies to ensure data privacy and compliance. Policy workflows can be initiated and compliance data and reports are captured.

Secoda

Secoda

secoda.co

Secoda is the fastest way to explore, understand, and use data. Companies like Chipotle, Cardinal Health, Kaufland, and Remitly use Secoda to get visibility into the health of their entire stack, reduce costs, and help their data teams run more efficiently. Powered by AI, Secoda creates a single source of truth for an organization’s data by connecting to all data sources, models, pipelines, databases, warehouses, and visualization tools. Secoda consolidates multiple tools into a single data management platform to simplify your data catalog, metadata management, lineage, governance, monitoring, and observability processes. Regardless of technical ability, it is the easiest way for any data or business stakeholder to turn their insights into action.

Workstream.io

Workstream.io

workstream.io

Workstream simplifies access to your analytic assets, ensuring users can confidently impact business outcomes by turning the massive scale of data in your organization into knowledge that teams can act on.

Dataland

Dataland

dataland.io

Next-Gen Internal Tools. 10x simpler to build and maintain. 10x faster for getting things done. Infinitely scalable. Dataland is the easiest way to deliver high-quality internal tools to your business users. It's secure, easy-to-use, and sets up in minutes.

Decube

Decube

decube.io

Decube is a data observability platform that helps data teams better understand the health of data in their system and prevent data quality incidents. decube provides three core modules: automated data quality monitoring, metadata discovery through data cataloging and checking of data diff via data reconciliation feature. Includes connections to popular data sources with no-code deployment such as PostgreSQL, Google Big Query etc.

Select Star

Select Star

selectstar.com

Select Star is an intelligent metadata platform that automatically analyzes & documents your data. From data catalog, lineage, usage analysis, and AI assistants, Select Star provides an easy to use data portal, where data teams can govern and manage their data with automation. Today, Select Star is used as a co-pilot of data teams for data governance, data migration, self-service analytics / data democratization, and cost optimization initiatives.

Erisna

Erisna

erisna.com

Erisna is an enterprise data catalog and discovery platform that enables data analysts, data engineers, data scientists, and data managers to get the most out of their data. Connect Erisna to various data sources such as Amazon Redshift, Google BigQuery, Microsoft Azure Synapse, Snowflake, PostgreSQL, and SQL Server to build your data dictionary, auto-detect sensitive data, automate data discovery, gather data pipeline requirements and improve data governance, all in one place. Our platform helps organizations increase productivity, reduce regulatory risks, make better decisions, and reduce costs significantly. Create your Erisna account and request a demo today!

Traceye

Traceye

traceye.io

Traceye is an Enterprise-grade data indexing infrastructure platform to build and deploy subgraphs with best-in-class performance, security and scalability. Experience Faster and Seamless Access to Indexed Blockchain Data with Traceye Subgraphs.

© 2025 WebCatalog, Inc.