25 06
2025
The time has come – you’re ready to build your data catalog. A big step towards data governance for your company. But how do you communicate the requirements for your data needs to make sure you get it right? Well, you definitely need to nail them down internally, first. And that can already be a complicated task.
In this article, we’ll talk about just that, including how to draft an RFP and select a vendor for your data catalog needs.
Before you can choose the right data catalog, you need a clear understanding of what your company actually needs from one.
So, before you look at the available data catalog tools, take a step back and look at your data management and data governance goals in detail.
Here’s how to evaluate tools and create an RFP that will attract the right vendors.
This is absolutely essential: start by gathering input from stakeholders across the company, from data analysts and engineers to compliance officers and business leaders.
Ask questions like:
Document both functional and non-functional requirements, including usability, scalability, integrations, and performance expectations.
Take stock of your current data sources, data platforms, and data governance tools. Your data catalog needs to integrate with all of them, including data lakes, warehouses, BI tools, and possibly even machine learning platforms.
Determine which metadata sources must be ingested, and in what formats. Consider:
Murdio tip: All of the above will heavily impact the cost of your data catalog. So, take a really careful inventory of all your data and the data quality. It’s where you can make significant savings, for example, by removing obsolete or duplicate data.
And if you work with us, we can make sure your data landscape (and data catalog costs) are optimized. We go into more detail about this in our recent article: Data Catalog Pricing.
Not all data catalog features are equally critical for every company. Prioritize yours based on your business goals and the data catalog benefits you expect, for instance:
Use your checklist to objectively compare vendors, making sure you’re getting the features you need.
Murdio tip: If you have doubts about the features you need, or are not really sure which ones are essential, and which are not, we can help. Schedule a conversation with a Murdio expert, and we can help you evaluate your internal needs and translate them into optimal features.
With your internal requirements clear, you can now build a Request for Proposal (RFP) to send to data catalog vendors. Here are some things to include:
The document should encourage vendors to tailor their proposals to your unique context.
Murdio tip: Again, you can talk to us first, so that the RFP is super specified, and we can already recommend some solutions.
As proposals come in, rank them against your feature checklist. Invite shortlisted candidates for a hands-on demo. Don’t forget to:
A well-organized evaluation process will help you avoid getting swayed by marketing and remain grounded in real, operational needs.
An enterprise data catalog is more than a metadata repository. It plays a central role in maintaining data security, privacy, and compliance, especially in regulated industries. Here are some of the essential security and compliance features to evaluate.
You should be able to define who can view, edit, and manage different types of metadata. Look for:
A robust data catalog should support automated or manual tagging of sensitive data, such as:
This tagging is critical for meeting privacy regulations and internal data handling policies.
Depending on your industry and geography, your data catalog should help you align with:
Check if the tool offers built-in compliance templates or reporting capabilities.
Although a data catalog doesn’t store actual data, it may expose metadata that reveals sensitive information. So, make sure that:
Data lineage helps demonstrate where data came from, how it transformed, and who interacted with it. This is essential not only for compliance audits but also for internal data governance.
When you’re choosing your data catalog software, data lineage will most likely be an add-on feature, so keep that in mind.
Read more: Data Catalog vs. Data Lineage.
Some data catalog best practices include user activity monitoring and detecting unusual behavior patterns that might signal a security issue.
Here’s an example of how we helped one of our clients catalog sensitive data to comply with FINMA Circular 2023/01: Case Study: Management and cataloging sensitive critical data elements in a Swiss bank
Even the most feature-rich data catalog can fall short if you don’t account for certain implementation challenges. Here are some of the most common roadblocks and how to plan for them.
If your metadata is outdated, incomplete, inconsistent, or scattered across silos, ingesting it into a centralized catalog won’t magically fix the problem. Plan to invest time in:
Not all catalogs support every data source out of the box. You might need to build custom connectors or adapt APIs (and we can help with custom development).
Some questions to ask:
A data catalog only delivers value if people actually use it. Challenges to anticipate include:
To overcome these:
As your data landscape grows, your catalog should scale with it. So, when comparing vendors, understand the following:
We recommend choosing a tool with open APIs, standard data models, and flexible deployment options. This will reduce the risk and cost of switching vendors in the future.
Once you’ve selected a data catalog, it’s time to plan for a smooth rollout. Here’s what to expect in terms of timeline and the people and tools you’ll need.
The timeline can vary depending on your organization’s size, data landscape complexity, and scope of implementation, so there’s no point in providing time estimates without knowing your requirements.
But here’s a (very) general breakdown of the process and milestones if you decide to work with Murdio and hire a Technical Implementation Team.
Now, this timeline might look different if you decide to:
Also, whether you have already purchased a data catalog tool or are only just figuring out your options will obviously also impact the implementation timeline.
“A good tip is to start small and go for the low-hanging fruit to gain momentum and show both value and quick results for the organization. 2-4 weeks should get you the POC. Another 2-4 weeks – an MVP. If you can’t get your data catalog completely set up within 3-4 months, you might be doing something wrong. Then, better call Murdio. 🙂”
Stan Radomiński, Data Governance Consultant @ Murdio, Collibra Ranger
Successful implementation requires collaboration across departments. Depending on the tool you use for your data catalog and whether you work with a vendor like Murdio to help you implement the data catalog, the key roles within your team include:
You may also need support from external consultants, especially for larger or more complex deployments. If you work with Murdio, we’ll make sure the team composition is the most efficient in your case, helping you optimize costs according to the budget you have available.
For example, when you implement a Collibra data catalog, you can work with:
If you want to know what setup is the most optimal for your data catalog project, let’s have a look together, first.
Make sure you have the necessary infrastructure to support your data catalog tool:
Also, confirm whether you have a budget for ongoing licensing, training, and support.
Apart from selecting the right data governance tools, evaluating data catalog requirements will let you set a solid foundation for effective data governance, discoverability, and compliance.When you clearly define your business needs, understand the technical landscape, and plan for implementation, you increase your chances of success.
Don’t rush the process, though. Involve the right stakeholders, ask tough questions, and be realistic about the challenges.
And if you need expert support, drop us a line, we’d love to help!
A data catalog helps organizations organize, manage, and discover metadata across diverse data sources. Its main purpose is to improve data accessibility, transparency, and governance.
Implementation can take anywhere from roughly 8 to 16 weeks, depending on factors such as data volume, number of integrations, and organizational readiness. This is arbitrary, though, as you need to take into account things like your team makeup and implemented features. In most cases, ongoing optimization continues after the initial rollout – or at least, that should be the case.
The most important data catalog features are automated data discovery with powerful search capabilities and end-to-end data lineage, allowing users to easily find relevant data and understand its origins.
Crucially, the platform must provide robust tools to operationalize the entire data governance process by enforcing security policies and compliance, while building trust through a collaborative business glossary and user certifications.
A data catalog impacts data governance processes by providing a unified control layer across a company’s entire modern data stack. Choosing a data catalog with robust integration and automated lineage is critical for success.
This approach fundamentally transforms the process of managing data by embedding accountability, security, and trust directly into the daily workflow, ensuring all data assets are used correctly and compliantly.
A data dictionary provides definitions and schema-level information about data elements. A data catalog, on the other hand, is broader, offering metadata management, data lineage, access control, and search capabilities.
Your organization likely needs a data catalog solution when data teams spend more time searching for trustworthy data than using it for analysis, leading to significant inefficiencies and a lack of data-driven decision-making.
A primary indicator is the difficulty in data discovery and data access. If your data scientists and analysts constantly ask “Where can I find…?” or “Is this data still relevant?”, it’s a clear sign that valuable time is being wasted.
Yes. Even smaller organizations benefit from centralized metadata, improved data discovery, and better governance. Many vendors offer scalable or lightweight solutions that fit smaller teams and budgets.
While you may not need a large team, at least a few dedicated roles, such as data stewards and administrators, are essential for managing metadata quality, onboarding users, and driving adoption.
At Murdio, we can help you define the most optimal setup and roles and create automated workflows that minimize the number of team members involved (which can also significantly bring down costs).
© 2025 Murdio - All Rights Reserved - made by Netwired