MarkLogic Data Hub Training Course
MarkLogic Data Hub is an open-source consolidated data repository that offers a set of tools and libraries to accelerate enterprise data integration and delivery.
This instructor-led, live training (online or onsite) is aimed at system administrators, database administrators, data architects, and developers who wish to install, configure, and manage MarkLogic Data Hub to organize and manage their data from various silos.
By the end of this training, participants will be able to customize, secure, track, and manage their enterprise data using MarkLogic Data Hub capabilities and tools.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction
Overview of MarkLogic Data Hub Features and Architecture
Getting Started with MarkLogic Data Hub
Importing, Migrating, and Converting Existing Artifacts
Exploring MarkLogic Data Hub Concepts
Setting up Users, Roles, and Privileges
Deploying Security Configuration Using QuickStart and ml-gradle
Working with Data Ingestion and Flow Pipelines
Working with Steps, Mapping, and Modules
Configuring Project Steps and Flows
Understanding Key Semantic Data Modeling Concepts
Accessing Data Using JavaScript APIs and SPARQL
Managing Data on DHS Using Hub Central
Managing On-Premises Data, Projects, Flows, and Steps
Serving Data Out of MarkLogic Using REST and ODBC
Tracking the Data History and Data Lineage Origin
Replicating Existing Data Flow with a New Data Source
Using Smart Mastering with MarkLogic Data Hub
Troubleshooting
Summary and Conclusion
Requirements
- Experience with database management systems
- Familiarity with JavaScript, C, C++, or any other programming language
Audience
- System administrators
- Database administrators
- Data architects
- Developers
Open Training Courses require 5+ participants.
MarkLogic Data Hub Training Course - Booking
MarkLogic Data Hub Training Course - Enquiry
MarkLogic Data Hub - Consultancy Enquiry
Testimonials (2)
The variety of the information shared and the clarity to explain terms in plain English.
Arisbe Mendoza - Fairtrade International
Course - GDPR Workshop
It's a hands-on session.
Vorraluck Sarechuer - Total Access Communication Public Company Limited (dtac)
Course - Talend Open Studio for ESB
Upcoming Courses
Related Courses
Data Ethics
14 HoursData Ethics focuses on the responsible collection, usage, and decision-making regarding data, ensuring that actions uphold human rights, privacy, transparency, and fairness.
This instructor-led live training (available online or onsite) is designed for public sector professionals who have limited or no prior training in data ethics, manage or govern data, and wish to understand ethical risks, evaluate real-world dilemmas, and apply principles of responsible data use in alignment with institutional values and public trust.
By the end of this training, participants will be able to:
- Define key concepts and frameworks in data ethics.
- Identify ethical risks and trade-offs in data collection, analysis, and deployment.
- Apply principles of transparency, consent, and fairness to real-world scenarios.
- Integrate ethical review into governance or operational workflows.
Format of the Course
- Interactive lecture and discussion.
- Hands-on analysis of real-world data ethics cases.
- Guided exercises focused on ethical evaluation and policy alignment.
Course Customization Options
- To request a customized training for this course based on your department's workflows or internal tools, please contact us to arrange.
Data Integrity and Availability
14 HoursData Integrity and Availability focuses on guaranteeing that information remains precise, complete, consistent, and accessible when required, particularly within high-trust public sector settings.
This instructor-led, live training (available online or onsite) targets public sector professionals tasked with managing or protecting data—regardless of their technical expertise—who aim to uphold the reliability, consistency, and accessibility of critical datasets and systems under their supervision.
Upon completing this training, participants will be able to:
- Define and distinguish the core principles of integrity and availability throughout the data lifecycle.
- Identify and mitigate data corruption, inconsistencies, or unauthorized changes.
- Architect data environments that support high availability and business continuity.
- Enforce policies and controls that sustain long-term data reliability.
Course Format
- Interactive lectures and discussions.
- Practical assessment of data risks and potential failure points.
- Guided exercises centered on policy creation and incident prevention.
Customization Options
- To arrange tailored training for this course based on your department's specific workflows or internal tools, please get in touch with us.
Data Policies and Standards
14 HoursData Policies and Standards represent a structured methodology to ensure that government data is created, maintained, accessed, and utilized in a manner that is consistent, secure, and aligned with legal and ethical guidelines.
This instructor-led, live training (available online or onsite) is designed for public sector professionals responsible for establishing or applying data policies—regardless of their technical background—who aim to standardize, document, and enforce data practices across departments or systems.
Upon completion of this training, participants will be able to:
- Define and distinguish between data policies, standards, and procedures.
- Draft and assess data governance policies in alignment with national and international frameworks.
- Promote consistent and high-quality data practices across teams and departments.
- Establish a foundation for compliance, audit readiness, and trustworthy data systems.
Course Format
- Interactive lectures and discussions.
- Hands-on drafting of sample policies and standards.
- Guided evaluation of existing data workflows and controls.
Course Customization Options
- To request a customized training session tailored to your department's workflows or internal tools, please contact us to arrange.
Data Strategy
14 HoursA Data Strategy constitutes the long-term blueprint for how an organization manages, leverages, and invests in its data assets to advance its mission, enhance public services, and maintain accountability.
This instructor-led live training, available both online and onsite, targets public sector professionals who have limited or nascent experience with data strategy. It is designed for those who shape or influence strategic decisions and aim to establish sustainable, mission-aligned data strategies across their organization or department.
Upon completion of this training, participants will be equipped to:
- Articulate the key components of a comprehensive data strategy.
- Synchronize data initiatives with organizational goals and public value creation.
- Create roadmaps covering data governance, infrastructure, skill development, and innovation.
- Assess organizational maturity and track progress toward becoming a data-driven entity.
Course Format
- Interactive lectures and discussions.
- Practical exercises in developing strategy components and roadmaps.
- Guided analysis of public sector case studies and strategic frameworks.
Course Customization Options
- To arrange a customized training session tailored to your department's specific workflows or internal tools, please contact us.
EBX5 for Developers
21 HoursThis instructor-led, live training in Italy (online or onsite) is intended for developers who wish to use EBX5 (TIBCO EBX) to enable a Master Data Management solution within their organization.
By the end of this training, participants will be able to:
- Interpret requirements and architect an MDM solution.
- Enable the management and integration of master data.
- Integrate and transfer data across multiple systems.
- Import data into EBX5 using match and merge logic.
- Design, create and document a data model that addresses their organization's business requirements.
- Integrate EBX5 with 3rd party services.
GDPR Workshop
7 HoursAchieve mastery of the General Data Protection Regulation fundamentals through an immersive one-day workshop tailored for managers, department heads, and compliance professionals. The curriculum covers GDPR basics, rights of data subjects, core data protection principles, consent requirements, obligations for breach notification, and the concept of privacy by design. Participants will gain practical frameworks to implement GDPR compliance strategies throughout their organizations, ensuring lawful data processing and fostering a culture of accountability regarding data protection.
How to Audit GDPR Compliance
14 HoursThis program is specifically designed for auditors and administrative personnel responsible for verifying that their control systems and IT infrastructures align with current legal and regulatory standards. The curriculum starts by clarifying fundamental GDPR principles and illustrating their impact on auditing practices. Participants will examine the rights of data subjects, the obligations of data controllers and processors, and key aspects of enforcement and compliance under the Regulation. Additionally, the training includes an audit framework provided by ISACA, empowering auditors to evaluate GDPR governance structures, response mechanisms, and supporting processes to mitigate risks linked to non-compliance.
Oracle GoldenGate
14 HoursThis instructor-led, live training in Italy (online or onsite) is designed for system administrators and developers who want to establish, deploy, and manage Oracle GoldenGate for data transformation purposes.
Upon completing this training, participants will be able to:
- Install and configure Oracle GoldenGate.
- Understand Oracle database replication using the Oracle GoldenGate tool.
- Grasp the architecture of Oracle GoldenGate.
- Configure and execute database replication and migration tasks.
- Optimize Oracle GoldenGate performance and resolve technical issues.
Personal Data Protection Officer - Basic Level
21 HoursTraining Objectives
- Familiarizing participants with the systematic and comprehensive framework of personal data protection under Polish and European legislation.
- Delivering practical insights into the updated regulations governing personal data processing.
- Highlighting key areas of legal risk associated with the implementation of the GDPR.
- Preparing participants to independently perform the duties of a Personal Data Protection Officer.
Personal Data Protection Officer - Advanced Level
14 HoursTraining Objectives
- Gaining practical knowledge on how to perform the tasks of the Data Protection Officer
- Gaining practical knowledge of auditing procedures and risk assessment methods
- Providing practical insights into the new rules for the processing of personal data
Privacy in Federal Institutions (Requirements under the Privacy Act)
7 HoursThe course 'Privacy in Federal Institutions' serves as a foundational program centered on the Privacy Act and its mandates for safeguarding personal information within government operations.
This live, instructor-led training—available online or in-person—is designed for public sector professionals who have limited or developing expertise in privacy legislation. It is intended for those who manage or process citizen data and seek to ensure compliance with the Privacy Act and associated federal standards.
Upon completion of this training, participants will be able to:
- Grasp the key provisions and principles of the Privacy Act.
- Recognize personal information and manage it in line with legal obligations.
- Establish and implement privacy-compliant practices in daily operations.
- Effectively address requests for access to information and corrections.
Course Format
- Interactive lectures and discussions.
- Practical application of policy scenarios relevant to the public sector.
- Guided exercises emphasizing compliance, documentation, and reporting.
Course Customization Options
- To arrange customized training tailored to your department's workflows or internal tools, please contact us.
Talend Administration Center (TAC)
14 HoursThis instructor-led, live training in Italy (online or onsite) is designed for system administrators, data scientists, and business analysts who wish to set up Talend Administration Center to deploy and manage organizational roles and tasks.
By the end of this training, participants will be able to:
- Install and configure Talend Administration Center.
- Understand and implement core Talend management principles.
- Build, deploy, and execute business projects or tasks within Talend.
- Monitor data security and develop business routines based on the TAC framework.
- Gain a broader understanding of big data applications.
Talend Big Data Integration
28 HoursThis instructor-led, live training in Italy (online or onsite) is aimed at technical persons who wish to deploy Talend Open Studio for Big Data to simplifying the process of reading and crunching through Big Data.
By the end of this training, participants will be able to:
- Install and configure Talend Open Studio for Big Data.
- Connect with Big Data systems such as Cloudera, HortonWorks, MapR, Amazon EMR and Apache.
- Understand and set up Open Studio's big data components and connectors.
- Configure parameters to automatically generate MapReduce code.
- Use Open Studio's drag-and-drop interface to run Hadoop jobs.
- Prototype big data pipelines.
- Automate big data integration projects.
Talend Data Stewardship
14 HoursThis instructor-led live training in Italy (online or onsite) targets beginner to intermediate data analysts aiming to deepen their expertise in managing and enhancing data quality using Talend Data Stewardship.
By the conclusion of this training, participants will be able to:
- Acquire a comprehensive grasp of the role data stewardship plays in maintaining high data quality.
- Apply Talend Data Stewardship for the management of data quality tasks.
- Create, assign, and manage tasks within Talend Data Stewardship, including customizing workflows.
- Utilize the tool’s reporting and monitoring features to track data quality and stewardship activities.
Talend Open Studio for ESB
21 HoursIn this instructor-led, live training in Italy, participants will learn how to use Talend Open Studio for ESB to create, connect, mediate and manage services and their interactions.
By the end of this training, participants will be able to
- Integrate, enhance and deliver ESB technologies as single packages in a variety of deployment environments.
- Understand and utilize Talend Open Studio's most used components.
- Integrate any application, database, API, or Web services.
- Seamlessly integrate heterogeneous systems and applications.
- Embed existing Java code libraries to extend projects.
- Leverage community components and code to extend projects.
- Rapidly integrate systems, applications and data sources within a drag-and-drop Eclipse environment.
- Reduce development time and maintenance costs by generating optimized, reusable code.