Back to Projects

Sovware Data Mandiri - Big Data Infrastructure

Enterprise-scale Big Data architecture with distributed systems, observability, and modern CI/CD practices supporting on-premise and cloud environments.

Sovware Data Mandiri - Big Data Infrastructure - Image 1
0 views
4 min read

Project Overview

At Sovware Data Mandiri, I served as a Junior System Administrator responsible for designing, implementing, and maintaining a comprehensive Big Data infrastructure. This role involved architecting scalable distributed systems, building observability solutions, implementing modern CI/CD practices, and establishing secure authentication mechanisms. The infrastructure supports both on-premise and cloud environments, handling complex data processing and storage requirements for enterprise operations.


Company & Role Context

Sovware Data Mandiri is an enterprise data solutions company based in Bandung, Indonesia. As a Junior System Administrator in a hybrid work environment, I was tasked with building and maintaining the technical foundation that enables the company's data processing and analytics capabilities.

The role encompassed:

  • Infrastructure design and implementation
  • System administration and operations
  • DevOps and CI/CD pipeline management
  • Security and authentication infrastructure
  • Monitoring and observability
  • Performance optimization and troubleshooting

My Responsibilities

As the primary infrastructure engineer, my key responsibilities included:

  • Designing Big Data architecture for scalability, high availability, and fault tolerance
  • Deploying and managing distributed systems on Proxmox hypervisor
  • Building and maintaining comprehensive monitoring and observability stacks
  • Developing and maintaining CI/CD pipelines using GitLab
  • Implementing centralized authentication and authorization systems
  • Configuring secure VPN services for infrastructure connectivity
  • Leading the modernization of deployment processes

Big Data Architecture & Technology Stack

Storage & Data Processing

The infrastructure leverages multiple technologies for distributed storage and processing:

  • MinIO

    • S3-compatible object storage for data lakes
    • High-performance distributed storage
  • Ceph

    • Distributed storage system for block and object storage
    • Provides redundancy and fault tolerance
  • Apache Iceberg & Nessie

    • Open table format for data lakes
    • ACID transactions and time-travel queries
  • Apache Parquet & Avro

    • Columnar and row-based data formats
    • Optimized for analytics and compression

Query & Analytics Engines

  • Dremio

    • Self-service analytics platform
    • Unified query engine across data sources
  • Trino (formerly PrestoSQL)

    • Distributed SQL query engine
    • Multi-source querying capabilities
  • Apache Hue

    • Web interface for data exploration
    • Query editor and visualization tools

Coordination & Streaming

  • Apache Zookeeper

    • Distributed coordination service
    • Cluster management and leader election
  • Apache NiFi

    • Data routing and transformation
    • Real-time data flow management

Data Visualization

  • Superset

    • Modern data visualization and dashboarding
    • Self-service analytics platform
  • Metabase

    • Business intelligence and analytics
    • User-friendly query builder

Infrastructure & Virtualization

  • Proxmox

    • Hypervisor for virtual machine management
    • KVM-based virtualization platform
  • Docker

    • Containerization for application deployment
    • Consistent environments across infrastructure

Networking & Load Balancing

  • Nginx

    • Reverse proxy and web server
    • Load balancing and API gateway
  • HAProxy

    • High-performance load balancer
    • Advanced routing and failover capabilities
  • Keepalived

    • Virtual IP management
    • High availability for load balancers

Database

  • MariaDB Galera Cluster

    • Multi-master replication
    • Synchronous replication for consistency
    • Active-active clustering

Observability & Monitoring Stack

I built a comprehensive observability solution to ensure system reliability and performance:

  • Prometheus

    • Metrics collection and time-series database
    • Scraping metrics from all infrastructure components
  • Grafana

    • Visualization and dashboarding
    • Real-time monitoring and alerting
  • Thanos

    • Long-term metrics storage
    • Global query view across Prometheus instances
  • Loki

    • Log aggregation and indexing
    • Centralized logging for all services
  • Alloy

    • Unified observability agent
    • Metrics, logs, and traces collection

This stack provides:

  • Real-time system performance monitoring
  • Centralized log aggregation and analysis
  • Historical metrics retention and analysis
  • Alerting and incident response capabilities
  • Distributed tracing for troubleshooting

CI/CD Pipeline & Deployment Automation

GitLab CI/CD Implementation

I designed and implemented a modern CI/CD pipeline using GitLab:

  • GitLab Runner

    • Distributed CI/CD executor
    • Docker-based job execution
  • GitLab Container Registry

    • Private container image repository
    • Secure artifact storage
  • Automated Workflows

    • Build automation
    • Testing and validation
    • Containerized artifact production
    • Semantic versioning

Deployment Process Modernization

Led the transition from manual deployment processes to automated pipelines:

  • Before: Manual SCP/SFTP-based deployments and manual tagging
  • After: Automated CI/CD with Semantic Versioning and Conventional Commits

Benefits achieved:

  • Improved release consistency
  • Reduced deployment errors
  • Faster time-to-production
  • Better version tracking and rollback capabilities
  • Enhanced team collaboration

Security & Authentication

Single Sign-On (SSO) Implementation

Implemented a centralized authentication infrastructure using Keycloak:

  • Keycloak

    • Identity and access management platform
    • OAuth 2.0 and OpenID Connect (OIDC) support
    • User federation and role-based access control
  • OAuth 2.0 & OIDC

    • Secure authentication protocol
    • Token-based authorization
    • Integration with multiple applications

VPN Services

Configured secure VPN services for infrastructure connectivity:

  • L2TP (Layer 2 Tunneling Protocol)

    • Tunneling protocol for VPN
    • Secure remote access
  • IPsec (Internet Protocol Security)

    • Encryption and authentication
    • Secure site-to-site connectivity
  • OpenVPN

    • Open-source VPN solution
    • Flexible and secure remote access
  • OpenSSL

    • Cryptographic toolkit
    • Certificate management and encryption

Key Achievements

Throughout my tenure at Sovware Data Mandiri, I achieved:

  • Designed and deployed a production-grade Big Data infrastructure supporting enterprise-scale operations
  • Built a comprehensive observability stack enabling proactive monitoring and incident response
  • Modernized deployment processes, reducing manual errors and improving consistency
  • Implemented enterprise-grade security with centralized SSO and VPN services
  • Established best practices for infrastructure as code and configuration management
  • Enabled the team to scale infrastructure efficiently while maintaining reliability

Challenges & Experience

Key challenges during this role included:

  • Managing complexity of distributed systems across multiple technologies
  • Ensuring high availability and fault tolerance in production environments
  • Balancing performance optimization with resource constraints
  • Coordinating infrastructure changes with minimal downtime
  • Learning and mastering multiple enterprise technologies simultaneously
  • Troubleshooting complex issues in distributed systems

These challenges provided invaluable experience in enterprise infrastructure management and problem-solving.


What I Learned

Through this role at Sovware Data Mandiri, I gained extensive experience in:

  • Designing and implementing enterprise-scale Big Data architectures
  • Managing distributed systems and ensuring high availability
  • Building comprehensive observability and monitoring solutions
  • Implementing modern CI/CD practices and automation
  • Security infrastructure including SSO and VPN services
  • Infrastructure as code and configuration management
  • Performance optimization and capacity planning
  • Troubleshooting complex distributed system issues
  • Leadership in modernizing legacy processes
  • Working with cutting-edge data technologies

Why This Project Matters

This role is significant in my career because it:

  • Represents full-time position in infrastructure and DevOps
  • Demonstrates my ability to design and implement enterprise-scale systems
  • Shows my capability to lead technical modernization initiatives
  • Reflects my expertise in observability and monitoring
  • Combines infrastructure, security, and automation expertise
  • Showcases my ability to work with complex, distributed technologies

Conclusion

My tenure at Sovware Data Mandiri as a Junior System Administrator was transformative in developing my infrastructure and DevOps expertise. By designing and implementing a comprehensive Big Data infrastructure, building observability solutions, modernizing CI/CD practices, and establishing secure authentication systems, I gained practical experience in enterprise-scale system administration.

This role reinforced the importance of reliability, scalability, and observability in production systems—principles that continue to guide my approach to infrastructure and DevOps work today.

More or Related Projects

BuildWithAngga Backend
November 30, 2024

BuildWithAngga Backend

A Laravel-based backend system for an educational platform, featuring saved articles, clean architecture, and comprehensive testing practices.

00
Tools:
+5
PT Citra Jelajah Informatika - Network Engineer & System Administrator
October 31, 2023

PT Citra Jelajah Informatika - Network Engineer & System Administrator

Network engineer and system administrator experience managing enterprise infrastructure including DNS, web servers, databases, mail servers, and network systems across multiple environments.

00
Tools:
+14
Express Open Telemetry
September 20, 2025

Express Open Telemetry

A sample Node.js Express application instrumented with OpenTelemetry, integrated with Language SDK, Grafana Tempo, Grafana, and PostgreSQL, running with Docker Compose.

0000
Tools:
+3