-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathindex2.html
More file actions
1394 lines (1349 loc) · 149 KB
/
index2.html
File metadata and controls
1394 lines (1349 loc) · 149 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Lavian Dsouza - Data Engineering Portfolio</title>
<link rel="icon" type="image/x-icon" href="/static/favicon.ico">
<script src="https://cdn.tailwindcss.com"></script>
<script src="https://unpkg.com/feather-icons"></script>
<script src="https://cdn.jsdelivr.net/npm/feather-icons/dist/feather.min.js"></script>
<script src="https://cdn.jsdelivr.net/npm/vanta@latest/dist/vanta.globe.min.js"></script>
<style>
@import url('https://fonts.googleapis.com/css2?family=Inter:wght@300;400;500;600;700&display=swap');
body {
font-family: 'Inter', sans-serif;
scroll-behavior: smooth;
}
.gradient-bg {
background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
}
.tech-card {
transition: all 0.3s ease;
border-left: 4px solid transparent;
}
.tech-card:hover {
transform: translateY(-5px);
border-left-color: #667eea;
box-shadow: 0 10px 25px -5px rgba(0, 0, 0, 0.1);
}
.tool-logo {
filter: grayscale(100%);
transition: all 0.3s ease;
}
.tool-logo:hover {
filter: grayscale(0%);
transform: scale(1.05);
}
.floating {
animation: float 6s ease-in-out infinite;
}
@keyframes float {
0% { transform: translateY(0px); }
50% { transform: translateY(-10px); }
100% { transform: translateY(0px); }
}
.vanta-bg {
min-height: 100vh;
width: 100%;
}
.skill-bar {
transition: width 1.5s ease-in-out;
}
.module-grid {
display: grid;
grid-template-columns: repeat(auto-fill, minmax(280px, 1fr));
gap: 1.5rem;
}
.data-flow {
position: relative;
overflow: hidden;
}
.data-flow::before {
content: '';
position: absolute;
top: 0;
left: -100%;
width: 100%;
height: 100%;
background: linear-gradient(90deg, transparent, rgba(255, 255, 255, 0.2), transparent);
animation: flow 3s linear infinite;
}
@keyframes flow {
0% { left: -100%; }
100% { left: 100%; }
}
</style>
</head>
<body class="bg-gray-50 text-gray-800">
<!-- Vanta.js Background -->
<div id="vanta-bg" class="vanta-bg fixed top-0 left-0 w-full h-full z-0"></div>
<!-- Main Content -->
<div class="relative z-10">
<!-- Hero Section -->
<section class="min-h-screen flex items-center justify-center px-4 py-20">
<div class="text-center max-w-4xl mx-auto bg-white/90 backdrop-blur-sm rounded-2xl p-8 md:p-12 shadow-2xl">
<div class="floating mb-8">
<div class="w-32 h-32 mx-auto rounded-full bg-gradient-to-r from-blue-500 to-purple-600 p-1">
<img src="https://media.licdn.com/dms/image/v2/D5603AQHNnZKEtJzXJQ/profile-displayphoto-shrink_200_200/profile-displayphoto-shrink_200_200/0/1722507642476?e=1762992000&v=beta&t=GNONN0FQUJQdgCwJKfsQt0W88wcon_NM008k17GXDmA" alt="Lavian Dsouza" class="w-full h-full rounded-full object-cover">
</div>
</div>
<h1 class="text-4xl md:text-6xl font-bold mb-4 bg-clip-text text-transparent gradient-bg">Lavian Dsouza</h1>
<h2 class="text-xl md:text-2xl text-gray-600 mb-6">Data Engineer & Analytics Specialist</h2>
<p class="text-lg mb-8 max-w-2xl mx-auto">
Master's in Mathematics | Python | SQL | Java | Scala | Data Visualization | Cloud Infrastructure | Machine Learning
</p>
<div class="flex justify-center space-x-4">
<a href="#skills" class="px-6 py-3 bg-blue-600 text-white rounded-lg hover:bg-blue-700 transition">My Skills</a>
<a href="#tools" class="px-6 py-3 border border-blue-600 text-blue-600 rounded-lg hover:bg-blue-50 transition">Tech Stack</a>
<a href="index1.html" class="px-6 py-3 border border-blue-600 text-blue-600 rounded-lg hover:bg-blue-50 transition">Blog</a>
</div>
</div>
</section>
<!-- Professional Summary -->
<section class="py-20 px-4 bg-white/80 backdrop-blur-sm">
<div class="max-w-6xl mx-auto">
<h2 class="text-3xl md:text-4xl font-bold text-center mb-12">Professional Summary</h2>
<div class="tech-card bg-white p-6 rounded-xl shadow-lg">
<p>Data Engineer and Analytics Specialist with over 8 years of experience in designing, building, and maintaining scalable data pipelines, ETL/ELT processes, and real-time data infrastructure for business intelligence, predictive modeling, and machine learning applications. Proficient in Python, SQL, Java, Scala, Spark, Kafka, Airflow, and cloud platforms including Azure, AWS, and GCP to process TB-scale datasets, ensure data quality, and deliver actionable insights. Proven track record of automating reporting, reducing operational inefficiencies by up to 70%, and collaborating with cross-functional teams to align data solutions with organizational goals. Passionate about leveraging mathematical modeling, AI, and big data technologies to solve complex challenges in finance, logistics, and operations. Google-certified in data analytics and IT automation, with a portfolio showcasing end-to-end projects in geospatial analysis, time-series forecasting, and NLP classification.</p>
</div>
</div>
</section>
<!-- Skills Section -->
<section id="skills" class="py-20 px-4 bg-white/80 backdrop-blur-sm">
<div class="max-w-6xl mx-auto">
<h2 class="text-3xl md:text-4xl font-bold text-center mb-12">Key Skills</h2>
<!-- Programming Languages -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="code" class="mr-2"></i> Programming Languages
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/c/c3/Python-logo-notext.svg/1869px-Python-logo-notext.svg.png" alt="Python" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Python</h4>
<p class="text-sm text-gray-600">Pandas, NumPy, scikit-learn, TensorFlow, PyTorch, Matplotlib, Plotly, FastAPI, SQLAlchemy, PySpark, Dask, Statsmodels, Featuretools</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://logodix.com/logo/542135.jpg" alt="SQL" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">SQL</h4>
<p class="text-sm text-gray-600">Query Optimization, DAX, Power Query, Window Functions, CTEs, Indexing Strategies</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/en/thumb/3/30/Java_programming_language_logo.svg/1200px-Java_programming_language_logo.svg.png" alt="Java" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Java</h4>
<p class="text-sm text-gray-600">Object-oriented programming, backend systems, big data integration</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://pluspng.com/img-png/scala-logo-png-scala-logo-1200x675.png" alt="Scala" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Scala</h4>
<p class="text-sm text-gray-600">Functional programming for JVM, Spark pipelines</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1b/R_logo.svg/310px-R_logo.svg.png" alt="R" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">R</h4>
<p class="text-sm text-gray-600">tidyverse, ggplot2, lubridate, data.table, caret, forecast</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://tse4.mm.bing.net/th/id/OIP.XBxYIVoyd0ss6fe1Ui0G6QHaEW?cb=12&rs=1&pid=ImgDetMain&o=7&rm=3" alt="Bash" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Bash</h4>
<p class="text-sm text-gray-600">Shell scripting, automation, cron jobs</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/1/18/C_Programming_Language.svg/1200px-C_Programming_Language.svg.png" alt="C" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">C</h4>
<p class="text-sm text-gray-600">Systems programming, low-level optimization</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/1/18/ISO_C%2B%2B_Logo.svg/1200px-ISO_C%2B%2B_Logo.svg.png" alt="C++" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">C++</h4>
<p class="text-sm text-gray-600">High-performance computing, numerical simulations</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/2/21/Matlab_Logo.png" alt="MATLAB" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">MATLAB</h4>
<p class="text-sm text-gray-600">Numerical computing environment, algorithm prototyping</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/2/20/Mathematica_Logo.svg" alt="Mathematica" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Mathematica</h4>
<p class="text-sm text-gray-600">Computational software, symbolic mathematics</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/6/6a/Gnu-octave-logo.svg" alt="Octave" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Octave</h4>
<p class="text-sm text-gray-600">MATLAB-compatible language, numerical simulations</p>
</div>
</div>
</div>
<!-- Compute / Cloud -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="cloud" class="mr-2"></i> Compute / Cloud
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://cdn.freebiesupply.com/logos/large/2x/aws-ec2-logo-svg-vector.svg" alt="AWS EC2" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">AWS EC2</h4>
<p class="text-sm text-gray-600">Amazon Elastic Compute Cloud (EC2) provides scalable computing capacity in the AWS Cloud. It allows you to run virtual servers, known as instances, to host applications and services. EC2 offers a variety of instance types to meet different computing needs. You can scale resources up or down based on demand, optimizing costs.</p>
<a href="https://docs.aws.amazon.com/ec2/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://symbols.getvecta.com/stencil_5/3_aws-emr.115b439538.jpg" alt="AWS EMR" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">AWS EMR</h4>
<p class="text-sm text-gray-600">Amazon Elastic MapReduce (EMR) is a cloud-native big data platform for processing vast amounts of data. It simplifies running frameworks like Apache Hadoop and Apache Spark. EMR automates cluster provisioning, configuration, and tuning. It integrates with other AWS services for data storage and analytics.</p>
<a href="https://docs.aws.amazon.com/emr/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://download.logo.wine/logo/Google_Compute_Engine/Google_Compute_Engine-Logo.wine.png" alt="GCP Compute Engine" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">GCP Compute Engine</h4>
<p class="text-sm text-gray-600">Google Cloud Compute Engine provides scalable virtual machines in the Google Cloud. It offers various machine types to suit different workloads. Compute Engine integrates with other GCP services for networking and storage. It supports custom machine types and preemptible VMs for cost savings.</p>
<a href="https://cloud.google.com/compute/docs" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://cloud.google.com/static/dataproc/images/dataproc-logo.png" alt="GCP Dataproc" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">GCP Dataproc</h4>
<p class="text-sm text-gray-600">Google Cloud Dataproc is a fast, fully managed cloud service for running Apache Spark and Hadoop clusters. It simplifies the setup and management of big data frameworks. Dataproc integrates with other GCP services like BigQuery and Cloud Storage. It supports automation and scaling to optimize resource usage.</p>
<a href="https://cloud.google.com/dataproc/docs" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://symbols.getvecta.com/stencil_27/102_vm-symbol.c0581746e3.png" alt="Azure VM" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Azure VM</h4>
<p class="text-sm text-gray-600">Azure Virtual Machines provide on-demand scalable computing resources in the Microsoft Azure cloud. They support various operating systems, including Linux and Windows. VMs can be configured with different sizes and performance levels. Azure offers tools for managing and automating VM deployments.</p>
<a href="https://learn.microsoft.com/en-us/azure/virtual-machines/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://pngimg.com/d/linux_PNG29.png" alt="On-Prem Linux Servers" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">On-Prem Linux Servers</h4>
<p class="text-sm text-gray-600">On-premises Linux servers are physical machines running Linux operating systems within an organization's data center. They offer full control over hardware and software configurations. Linux servers are commonly used for hosting applications and services. They can be integrated with cloud services for hybrid cloud architectures.</p>
<a href="https://tldp.org/guides.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.citypng.com/public/uploads/preview/hd-microsoft-windows-server-logo-png-701751694708203flnnifkrrb.png" alt="On-Prem Windows Servers" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">On-Prem Windows Servers</h4>
<p class="text-sm text-gray-600">On-premises Windows servers are physical machines running Microsoft Windows Server operating systems within an organization's data center. They provide enterprise-grade features for hosting applications and services. Windows servers support integration with Active Directory and other Microsoft services. They can be managed using tools like Windows Admin Center and PowerShell.</p>
<a href="https://learn.microsoft.com/en-us/windows-server/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/3/39/Kubernetes_logo_without_workmark.svg/2109px-Kubernetes_logo_without_workmark.svg.png" alt="Kubernetes" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Kubernetes</h4>
<p class="text-sm text-gray-600">Kubernetes is an open-source platform for automating the deployment, scaling, and operation of application containers. It manages clusters of containers and ensures high availability. Kubernetes supports various container runtimes and integrates with cloud services. It provides features like load balancing, storage orchestration, and self-healing.</p>
<a href="https://kubernetes.io/docs/home/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://assets.stickpng.com/images/62a9c7c08ff6441a2952dad3.png" alt="Docker" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Docker</h4>
<p class="text-sm text-gray-600">Docker is a platform for developing, shipping, and running applications in containers. It allows developers to package applications with all dependencies into a standardized unit. Docker simplifies application deployment and scaling across environments. It integrates with various orchestration tools like Kubernetes and Docker Swarm.</p>
<a href="https://docs.docker.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/OpenShift-LogoType.svg/719px-OpenShift-LogoType.svg.png" alt="OpenShift" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">OpenShift</h4>
<p class="text-sm text-gray-600">OpenShift is an open-source container application platform based on Kubernetes. It provides developer and operational tools for building, deploying, and managing applications. OpenShift offers features like automated builds, scaling, and monitoring. It integrates with various CI/CD pipelines and cloud services.</p>
<a href="https://docs.redhat.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://1000logos.net/wp-content/uploads/2024/08/Oracle-Cloud-Logo.jpg" alt="Oracle Cloud Infrastructure (OCI)" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Oracle Cloud Infrastructure (OCI)</h4>
<p class="text-sm text-gray-600">Oracle Cloud Infrastructure (OCI) is a comprehensive suite of cloud services offering compute, storage, networking, and database capabilities. OCI is designed to run enterprise applications with high performance and security. It provides a unified platform for building, deploying, and managing applications in the cloud. OCI supports hybrid and multicloud architectures, integrating with other cloud providers.</p>
<a href="https://docs.oracle.com/iaas/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.ibm.com/cloud/static/images/ibm-cloud-logo.svg" alt="IBM Cloud" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">IBM Cloud</h4>
<p class="text-sm text-gray-600">IBM Cloud offers a range of cloud computing services, including IaaS, PaaS, and SaaS. It provides tools for AI, data analytics, and enterprise application development. IBM Cloud supports hybrid and multicloud environments with strong security features. It integrates with IBM's legacy systems and modern cloud-native applications.</p>
<a href="https://cloud.ibm.com/docs" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/f/f1/Alibaba_Cloud_Logo.png" alt="Alibaba Cloud" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Alibaba Cloud</h4>
<p class="text-sm text-gray-600">Alibaba Cloud is a leading cloud services provider in Asia, offering a wide range of cloud computing solutions. It provides services such as Elastic Compute Service (ECS), Object Storage Service (OSS), and ApsaraDB. Alibaba Cloud supports global deployments with data centers in multiple regions. It is known for its strong presence in the Asia-Pacific market.</p>
<a href="https://www.alibabacloud.com/help/en" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/f/ff/DigitalOcean_logo.svg" alt="DigitalOcean" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">DigitalOcean</h4>
<p class="text-sm text-gray-600">DigitalOcean is a cloud infrastructure provider focused on simplicity and scalability for developers. It offers Droplets (virtual private servers), managed databases, and Kubernetes clusters. DigitalOcean is popular among startups and small businesses for its ease of use. It provides transparent pricing and a user-friendly interface.</p>
<a href="https://docs.digitalocean.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/b/bc/Linode_updated_logo.png" alt="Linode" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Linode</h4>
<p class="text-sm text-gray-600">Linode is a cloud hosting provider offering virtual private servers and cloud computing services. It provides a range of plans suitable for developers, startups, and small businesses. Linode is known for its straightforward pricing and reliable performance. It offers a variety of tools for managing and deploying applications.</p>
<a href="https://www.linode.com/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://logo.svgcdn.com/l/vultr-8x.png" alt="Vultr" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Vultr</h4>
<p class="text-sm text-gray-600">Vultr is a cloud infrastructure provider offering compute instances and block storage. It provides a global network of data centers for low-latency deployments. Vultr is known for its competitive pricing and simplicity. It caters to developers and businesses seeking cost-effective cloud solutions.</p>
<a href="https://docs.vultr.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://cdn.freebiesupply.com/images/large/2x/vmware-logo-png.png" alt="VMware vSphere / vCloud" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">VMware vSphere / vCloud</h4>
<p class="text-sm text-gray-600">VMware vSphere is a virtualization platform for enterprise data centers. It provides virtual machines, resource management, and high availability. vCloud extends VMware virtualization to private and hybrid cloud. It integrates with enterprise applications and storage systems.</p>
<a href="https://techdocs.broadcom.com/us/en/vmware-cis/vsphere/vsphere/8-0.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Operating System -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="hard-drive" class="mr-2"></i> Operating System
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/9/9e/UbuntuCoF.svg/1024px-UbuntuCoF.svg.png" alt="Ubuntu" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Ubuntu</h4>
<p class="text-sm text-gray-600">Ubuntu is a popular Linux distribution known for its ease of use and community support. It is widely used for server deployments and cloud environments. Ubuntu offers regular updates and long-term support releases. It has a large repository of software packages and tools.</p>
<a href="https://help.ubuntu.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://1000logos.net/wp-content/uploads/2021/04/Red-Hat-logo.png" alt="Red Hat Enterprise Linux" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Red Hat Enterprise Linux</h4>
<p class="text-sm text-gray-600">Red Hat Enterprise Linux (RHEL) is an enterprise-grade Linux distribution. It provides a stable and secure platform for running applications and services. RHEL offers long-term support and certified hardware compatibility. It includes tools for system management and automation.</p>
<a href="https://docs.redhat.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://wiki.centos.org/attachments/ArtWork(2f)Brand(2f)Logo/centos-logo-light.png" alt="CentOS Stream" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">CentOS Stream</h4>
<p class="text-sm text-gray-600">CentOS Stream is a free and open-source Linux distribution that serves as a rolling-release upstream for RHEL. It is widely used for server deployments and development environments. CentOS Stream offers a community-driven approach with continuous updates. It provides a stable and secure platform for various applications (note: Original CentOS Linux EOL in 2021; migrate to Stream or alternatives like Rocky Linux/AlmaLinux for stability).</p>
<a href="https://docs.centos.org/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://images.seeklogo.com/logo-png/39/1/suse-logo-png_seeklogo-395496.png" alt="SUSE" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">SUSE</h4>
<p class="text-sm text-gray-600">SUSE Linux Enterprise Server is an enterprise-class Linux server operating system. It offers a reliable and secure platform for running applications and services. SUSE provides tools for system management and automation. It supports various hardware architectures and cloud platforms.</p>
<a href="https://documentation.suse.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.citypng.com/public/uploads/preview/hd-microsoft-windows-server-logo-png-701751694708203flnnifkrrb.png" alt="Windows Server" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Windows Server</h4>
<p class="text-sm text-gray-600">Windows Server is a server operating system developed by Microsoft. It provides a platform for hosting applications and services. Windows Server includes features for virtualization, networking, and security. It integrates with other Microsoft services and tools.</p>
<a href="https://learn.microsoft.com/en-us/windows-server/" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Storage -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="database" class="mr-2"></i> Storage
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/b/bc/Amazon-S3-Logo.svg" alt="Amazon S3" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Amazon S3</h4>
<p class="text-sm text-gray-600">Amazon Simple Storage Service (S3) offers scalable object storage for data backup, archival, and analytics. It provides high durability and availability for stored objects. S3 supports a range of storage classes to optimize costs. It integrates with various AWS services for data processing and analysis.</p>
<a href="https://docs.aws.amazon.com/s3/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/0/0e/Hadoop_logo.svg" alt="HDFS" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">HDFS</h4>
<p class="text-sm text-gray-600">Hadoop Distributed File System (HDFS) is designed for storing very large files across multiple machines. It provides high throughput access to application data. HDFS is fault-tolerant and automatically replicates data blocks across nodes. It is the backbone of Hadoop’s storage layer for big data processing.</p>
<a href="https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://p7.hiclipart.com/preview/880/426/430/azure-data-lake-microsoft-azure-sql-database-big-data-data-lake-thumbnail.jpg" alt="Azure Data Lake" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Azure Data Lake</h4>
<p class="text-sm text-gray-600">Azure Data Lake Storage is a scalable data storage service for big data analytics. It supports hierarchical namespace and fine-grained access control. Data Lake integrates with Azure analytics services like Synapse and Databricks. It offers high throughput and low latency for data operations.</p>
<a href="https://learn.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-introduction" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.citypng.com/public/uploads/preview/hd-google-cloud-storage-logo-png-701751694778212ny7ddsjx3z.png" alt="Google Cloud Storage" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Google Cloud Storage</h4>
<p class="text-sm text-gray-600">Google Cloud Storage is a scalable object storage service for data archiving, backup, and analytics. It provides high durability and low-latency access to data. Cloud Storage supports various storage classes for cost optimization. It integrates with GCP services like BigQuery and Dataflow.</p>
<a href="https://cloud.google.com/storage/docs" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://download.logo.wine/logo/Apache_HBase/Apache_HBase-Logo.wine.png" alt="HBase" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">HBase</h4>
<p class="text-sm text-gray-600">Apache HBase is a distributed, scalable, big data store modeled after Google Bigtable. It allows random, real-time read/write access to large datasets. HBase integrates seamlessly with Hadoop and HDFS for storage. It is used for applications requiring fast access to sparse datasets.</p>
<a href="https://hbase.apache.org/book.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/1/1e/Apache-cassandra-icon.png" alt="Cassandra" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Cassandra</h4>
<p class="text-sm text-gray-600">Apache Cassandra is a NoSQL distributed database for handling large amounts of data across many servers. It offers high availability with no single point of failure. Cassandra supports a flexible schema and tunable consistency. It is ideal for applications needing high write throughput.</p>
<a href="https://cassandra.apache.org/doc/latest/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/9/93/MongoDB_Logo.svg/2560px-MongoDB_Logo.svg.png" alt="MongoDB" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">MongoDB</h4>
<p class="text-sm text-gray-600">MongoDB is a NoSQL document-oriented database. It stores data in flexible JSON-like documents with dynamic schemas. MongoDB supports high availability and horizontal scaling. It is widely used in modern web and mobile applications.</p>
<a href="https://www.mongodb.com/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/6/67/Couchbase%2C_Inc._official_logo.png" alt="Couchbase" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Couchbase</h4>
<p class="text-sm text-gray-600">Couchbase is a distributed NoSQL document database. It offers high performance, scalability, and flexible data models. Couchbase supports key-value access, full-text search, and analytics. It is commonly used in interactive web and mobile applications.</p>
<a href="https://docs.couchbase.com/home/index.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/f/fd/DynamoDB.png" alt="DynamoDB" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">DynamoDB</h4>
<p class="text-sm text-gray-600">Amazon DynamoDB is a fully managed NoSQL database service. It provides fast, predictable performance and seamless scalability. DynamoDB integrates with AWS services like Lambda and EMR. It is used for applications that require low-latency data access.</p>
<a href="https://docs.aws.amazon.com/dynamodb/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://p1.hiclipart.com/preview/663/349/198/graphy-logo-kudu-antelope-horn-antler-deer-wildlife-cowgoat-family-png-clipart.jpg" alt="Kudu" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Kudu</h4>
<p class="text-sm text-gray-600">Apache Kudu is a columnar storage manager for fast analytics on fast data. It provides a combination of fast inserts/updates and efficient columnar scans. Kudu integrates with Impala, Spark, and Hadoop ecosystem. It is designed for real-time analytics on mutable data.</p>
<a href="https://kudu.apache.org/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/a/aa/DL-stacked-RGB-200px.png" alt="Delta Lake" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Delta Lake</h4>
<p class="text-sm text-gray-600">Delta Lake is an open-source storage layer that adds ACID transactions to Spark and big data workloads. It allows scalable, reliable data lakes. Delta Lake supports schema evolution and time travel queries. It integrates with Spark, Databricks, and cloud storage.</p>
<a href="https://delta-docs-incubator.netlify.app/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://p7.hiclipart.com/preview/637/556/885/iceberg-logo-clip-art-iceberg.jpg" alt="Iceberg" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Iceberg</h4>
<p class="text-sm text-gray-600">Apache Iceberg is an open table format for huge analytic datasets. It supports ACID transactions and schema evolution. Iceberg is optimized for high-performance reads and writes. It integrates with Spark, Flink, Trino, and Hive.</p>
<a href="https://iceberg.apache.org/docs/nightly/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.citypng.com/public/uploads/preview/hd-mysql-logo-transparent-background-701751694771788209ydqoapx.png" alt="MySQL" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">MySQL</h4>
<p class="text-sm text-gray-600">MySQL is an open-source relational database management system. It supports SQL queries and transactions for structured data. MySQL is widely used for web applications and data warehousing. It integrates with various programming languages and tools.</p>
<a href="https://dev.mysql.com/doc/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/a/ad/Logo_PostgreSQL.png" alt="PostgreSQL" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">PostgreSQL</h4>
<p class="text-sm text-gray-600">PostgreSQL is an advanced open-source relational database. It supports complex queries, indexing, and extensibility. PostgreSQL is known for its standards compliance and robustness. It integrates with analytics and GIS tools.</p>
<a href="https://www.postgresql.org/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/5/50/Oracle_logo.svg/2560px-Oracle_logo.svg.png" alt="Oracle" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Oracle</h4>
<p class="text-sm text-gray-600">Oracle Database is a multi-model database management system. It supports SQL and NoSQL data models for enterprise applications. Oracle offers high availability, security, and scalability features. It integrates with cloud and on-prem environments.</p>
<a href="https://docs.oracle.com/en/database/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://cdn.freebiesupply.com/logos/large/2x/microsoft-sql-server-logo-svg-vector.svg" alt="SQL Server" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">SQL Server</h4>
<p class="text-sm text-gray-600">Microsoft SQL Server is a relational database management system. It supports transaction processing, business intelligence, and analytics. SQL Server offers integration with Azure and on-prem tools. It provides security and performance optimization features.</p>
<a href="https://learn.microsoft.com/en-us/sql/sql-server/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://mariadb.com/wp-content/uploads/2019/11/mariadb-logo-vert_blue-transparent.png" alt="MariaDB" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">MariaDB</h4>
<p class="text-sm text-gray-600">MariaDB is a community-developed fork of MySQL. It offers drop-in compatibility with MySQL and additional features. MariaDB supports high-performance querying and replication. It is used for web applications and data analytics.</p>
<a href="https://mariadb.com/kb/en/documentation/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/a/a5/Neo4j-logo2024color.png" alt="Neo4j" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Neo4j</h4>
<p class="text-sm text-gray-600">Neo4j is a graph database optimized for graph analytics. It allows modeling relationships and performing graph queries efficiently. Neo4j integrates with analytics and ML tools for recommendation and network analysis. It supports Cypher query language and high-performance graph algorithms.</p>
<a href="https://neo4j.com/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.pngfind.com/pngs/m/415-4155085_tigergraph-logo-hd-png-download.png" alt="TigerGraph" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">TigerGraph</h4>
<p class="text-sm text-gray-600">TigerGraph is a native parallel graph database for large-scale graph analytics. It supports real-time graph queries and analytics. TigerGraph provides GSQL for querying and algorithm execution. It integrates with AI, BI, and ETL pipelines for graph processing.</p>
<a href="https://docs.tigergraph.com/home/" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- SQL / Data Warehouse -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="database" class="mr-2"></i> SQL / Data Warehouse
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/b/bb/Apache_Hive_logo.svg" alt="Hive" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Hive</h4>
<p class="text-sm text-gray-600">Apache Hive is a data warehouse software built on top of Hadoop. It provides SQL-like querying capabilities for large datasets. Hive translates queries into MapReduce, Tez, or Spark jobs. It is widely used for batch processing in Hadoop ecosystems.</p>
<a href="https://hive.apache.org/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://assets.stickpng.com/images/62bc4958071dec17849af2e4.png" alt="Presto" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Presto</h4>
<p class="text-sm text-gray-600">Presto is a distributed SQL query engine designed for big data analytics. It allows querying data across multiple sources including Hive, Cassandra, and relational databases. Presto provides fast, interactive query performance. It is used for large-scale data exploration and analytics.</p>
<a href="https://prestodb.io/docs/current/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/5/57/Trino-logo-w-bk.svg" alt="Trino" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Trino</h4>
<p class="text-sm text-gray-600">Trino is a distributed SQL query engine for analytics on large datasets. It supports querying multiple data sources like Hive, Cassandra, and MySQL. Trino offers low-latency, high-performance querying. It is widely used in enterprises for interactive analytics.</p>
<a href="https://trino.io/docs/current/index.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://cdn.freebiesupply.com/logos/large/2x/chevy-impala-logo-png-transparent.png" alt="Impala" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Impala</h4>
<p class="text-sm text-gray-600">Apache Impala is an open-source MPP SQL query engine for Hadoop. It enables low-latency and high-performance queries on HDFS and HBase. Impala integrates with Hive Metastore and Hadoop ecosystem. It is ideal for real-time analytics on large datasets.</p>
<a href="https://impala.apache.org/impala-docs.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://c8.alamy.com/comp/R1RWX0/drill-vector-icon-isolated-on-transparent-background-drill-transparency-logo-concept-R1RWX0.jpg" alt="Drill" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Drill</h4>
<p class="text-sm text-gray-600">Apache Drill is a SQL query engine for large-scale datasets. It allows schema-free queries on structured and semi-structured data. Drill integrates with HDFS, NoSQL, and cloud storage. It is used for exploratory and interactive analytics.</p>
<a href="https://drill.apache.org/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.vhv.rs/dpng/d/410-4104998_the-birth-of-the-phoenix-png-download-transparent.png" alt="Phoenix" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Phoenix</h4>
<p class="text-sm text-gray-600">Apache Phoenix provides a SQL layer over HBase for OLTP and analytics. It allows low-latency queries on HBase tables using SQL syntax. Phoenix supports secondary indexing, joins, and transactions. It integrates with Hadoop ecosystem tools for big data workflows.</p>
<a href="https://phoenix.apache.org/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/f/ff/Snowflake_Logo.svg/2560px-Snowflake_Logo.svg.png" alt="Snowflake" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Snowflake</h4>
<p class="text-sm text-gray-600">Snowflake is a cloud-based data warehouse platform. It offers scalability, concurrency, and performance optimization. Snowflake separates storage and compute for flexible resource usage. It integrates with various ETL and BI tools.</p>
<a href="https://docs.snowflake.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/7/73/Amazon-Redshift-Logo.svg/1862px-Amazon-Redshift-Logo.svg.png" alt="Redshift" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Redshift</h4>
<p class="text-sm text-gray-600">Amazon Redshift is a fully managed cloud data warehouse. It allows running complex analytical queries on structured data. Redshift offers columnar storage, parallel processing, and compression. It integrates with AWS ecosystem for ETL and analytics workflows.</p>
<a href="https://docs.aws.amazon.com/redshift/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://1000logos.net/wp-content/uploads/2024/10/BigQuery-Logo.jpg" alt="BigQuery" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">BigQuery</h4>
<p class="text-sm text-gray-600">Google BigQuery is a fully managed, serverless data warehouse. It supports fast SQL queries over large datasets. BigQuery integrates with GCP analytics, ML, and visualization tools. It allows real-time analytics and machine learning on stored data.</p>
<a href="https://cloud.google.com/bigquery/docs" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://images.seeklogo.com/logo-png/43/1/azure-synapse-analytics-logo-png_seeklogo-434054.png" alt="Synapse" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Synapse</h4>
<p class="text-sm text-gray-600">Azure Synapse Analytics combines big data and data warehousing. It supports both serverless and provisioned resource models. Synapse integrates with Azure data services for pipelines and analytics. It provides query performance optimization and monitoring tools.</p>
<a href="https://learn.microsoft.com/en-us/azure/synapse-analytics/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/0/0c/Teradata_logo_2018.png" alt="Teradata" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Teradata</h4>
<p class="text-sm text-gray-600">Teradata is a scalable relational database for large-scale analytics. It supports complex SQL queries and parallel processing. Teradata is used for enterprise data warehousing and BI solutions. It integrates with ETL and analytics platforms for decision support.</p>
<a href="https://docs.teradata.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/0/0f/Greenplum_Logo.jpg" alt="Greenplum" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Greenplum</h4>
<p class="text-sm text-gray-600">Greenplum Database is a massively parallel processing data warehouse based on PostgreSQL. It supports large-scale analytics and data processing. Greenplum provides advanced analytics functions and extensibility. It integrates with big data tools and ETL pipelines.</p>
<a href="https://techdocs.broadcom.com/us/en/vmware-tanzu/data-solutions/tanzu-greenplum/7/greenplum-database/landing-index.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/0/0e/Clickhouse.png" alt="ClickHouse" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">ClickHouse</h4>
<p class="text-sm text-gray-600">ClickHouse is a columnar OLAP database for real-time analytics. It supports high-throughput ingestion and query performance. ClickHouse is used for event analytics, monitoring, and BI. It integrates with Spark, Kafka, and visualization tools.</p>
<a href="https://clickhouse.com/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/4/40/DuckDB_logo.svg" alt="DuckDB" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">DuckDB</h4>
<p class="text-sm text-gray-600">DuckDB is an in-process SQL OLAP database for analytics. It is optimized for single-node queries on local data files. DuckDB integrates with Python, R, and Pandas workflows. It supports Parquet, CSV, and other columnar formats.</p>
<a href="https://duckdb.org/docs/stable/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.apache.org/logos/res/doris/doris.png" alt="Apache Doris" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Apache Doris</h4>
<p class="text-sm text-gray-600">Apache Doris is a MPP (Massively Parallel Processing) analytical database. It provides high-performance real-time analytics for large datasets. Doris supports standard SQL and integrates with BI tools. It is used for dashboards, reporting, and ad-hoc queries.</p>
<a href="https://doris.apache.org/docs/3.0/gettingStarted/what-is-apache-doris/" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Processing - Batch -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="cpu" class="mr-2"></i> Processing - Batch
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/0/0e/Hadoop_logo.svg" alt="Hadoop MapReduce" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Hadoop MapReduce</h4>
<p class="text-sm text-gray-600">Hadoop MapReduce is the original batch processing framework for Hadoop. It splits large datasets into independent chunks processed in parallel across nodes. MapReduce handles fault tolerance and data distribution automatically. It is suited for large-scale, disk-based data processing workloads.</p>
<a href="https://hadoop.apache.org/docs/stable/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapReduceTutorial.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/f/f3/Apache_Spark_logo.svg/1280px-Apache_Spark_logo.svg.png" alt="Spark Batch" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Spark Batch</h4>
<p class="text-sm text-gray-600">Apache Spark Batch provides in-memory distributed processing for big data. It accelerates batch analytics with faster processing than MapReduce. Spark integrates with Hadoop, Hive, HBase, and other data sources. It supports multiple languages like Python, Scala, and Java.</p>
<a href="https://spark.apache.org/docs/latest/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/6/68/Tezos_Logo_2022.png/250px-Tezos_Logo_2022.png" alt="Tez" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Tez</h4>
<p class="text-sm text-gray-600">Apache Tez is an optimized DAG-based execution engine for Hadoop workloads. It improves performance for Hive and Pig by replacing traditional MapReduce jobs. Tez reduces job latency and increases throughput. It allows complex data workflows to execute efficiently as directed acyclic graphs.</p>
<a href="https://tez.apache.org/" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Data Visualization & BI -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="bar-chart-2" class="mr-2"></i> Data Visualization & BI
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://tse2.mm.bing.net/th/id/OIP.ZT5zsMj1BW9yERM7N05LfgHaEK?cb=12&rs=1&pid=ImgDetMain&o=7&rm=3" alt="Power BI" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Power BI</h4>
<p class="text-sm text-gray-600">Business intelligence tool, dashboards</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://tse4.mm.bing.net/th/id/OIP.O53hQPs0k17O3yjUkNHeEwHaFF?cb=12&rs=1&pid=ImgDetMain&o=7&rm=3" alt="Tableau" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Tableau</h4>
<p class="text-sm text-gray-600">Data visualization software, reporting</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://tse3.mm.bing.net/th/id/OIP.yuN517T2Ged-3vBHMYG7jwAAAA?cb=12&rs=1&pid=ImgDetMain&o=7&rm=3" alt="MicroStrategy" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">MicroStrategy</h4>
<p class="text-sm text-gray-600">Enterprise analytics</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://tse2.mm.bing.net/th/id/OIP.q4xNo2DqPk52YlpYp6QwMAHaIP?cb=12&rs=1&pid=ImgDetMain&o=7&rm=3" alt="Plotly" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Plotly</h4>
<p class="text-sm text-gray-600">Interactive graphing library</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://matplotlib.org/_static/logo_dark.svg" alt="Matplotlib" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Matplotlib</h4>
<p class="text-sm text-gray-600">Plotting library, Python visualizations</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://ggplot2.tidyverse.org/logo.png" alt="ggplot2" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">ggplot2</h4>
<p class="text-sm text-gray-600">Data visualization in R</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://seaborn.pydata.org/_images/logo-tall-lightbg.svg" alt="Seaborn" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Seaborn</h4>
<p class="text-sm text-gray-600">Statistical data visualization</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://pipedream.com/s.v0/app_mqehqz/logo/96" alt="Looker" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Looker</h4>
<p class="text-sm text-gray-600">Business intelligence and visualization</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://tse4.mm.bing.net/th/id/OIP.0CTQu5RcpcAN5-HfQKl1mQHaHH?cb=12&rs=1&pid=ImgDetMain&o=7&rm=3" alt="Superset" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Superset</h4>
<p class="text-sm text-gray-600">Open-source BI tool</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://tse1.mm.bing.net/th/id/OIP.qGiOz3fstOkWPLde7egi-wHaHa?cb=12&rs=1&pid=ImgDetMain&o=7&rm=3" alt="Dash" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Dash</h4>
<p class="text-sm text-gray-600">Python web app framework for dashboards</p>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://th.bing.com/th/id/R.20caf36fcb6f6857cc540b569c47524b?rik=sWhAPRCKoUZBqQ&pid=ImgRaw&r=0" alt="Kibana" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Kibana</h4>
<p class="text-sm text-gray-600">Data visualization for Elasticsearch</p>
</div>
</div>
</div>
<!-- Processing - ETL / Scripting -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="code" class="mr-2"></i> Processing - ETL / Scripting
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.freeiconspng.com/uploads/pig-icon-5.png" alt="Pig" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Pig</h4>
<p class="text-sm text-gray-600">Apache Pig is a high-level scripting platform for processing large data sets in Hadoop. It provides a language called Pig Latin for expressing data transformations. Pig scripts are compiled into MapReduce or Tez jobs for execution. It is suitable for ETL tasks, data pipelines, and preprocessing.</p>
<a href="https://pig.apache.org/docs/r0.17.0/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/f/f3/Apache_Spark_logo.svg/2560px-Apache_Spark_logo.svg.png" alt="Spark ETL (PySpark / Scala)" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Spark ETL (PySpark / Scala)</h4>
<p class="text-sm text-gray-600">Apache Spark can be used for ETL workflows using PySpark or Scala APIs. It allows transformations and aggregations in memory for faster processing. Spark integrates with Hive, HDFS, Delta Lake, and other sources. It is widely used for batch and stream ETL pipelines.</p>
<a href="https://spark.apache.org/docs/latest/api/python/index.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/9/97/Talend_logo.svg/1200px-Talend_logo.svg.png" alt="Talend" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Talend</h4>
<p class="text-sm text-gray-600">Talend is an enterprise ETL and data integration platform. It provides visual tools for designing and automating ETL workflows. Talend supports batch and real-time data integration across multiple sources. It integrates with Hadoop, Spark, cloud storage, and databases.</p>
<a href="https://help.qlik.com/talend/en-US/release-notes/8.0/documentation" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://seekvectorlogo.com/wp-content/uploads/2019/11/informatica-vector-logo.png" alt="Informatica" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Informatica</h4>
<p class="text-sm text-gray-600">Informatica is a widely used data integration and ETL tool. It provides GUI-based workflows for batch and real-time processing. Informatica supports data quality, cleansing, and governance features. It integrates with cloud, Hadoop, and relational databases.</p>
<a href="https://docs.informatica.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/f/ff/Apache-nifi-logo.svg/1280px-Apache-nifi-logo.svg.png" alt="NiFi" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">NiFi</h4>
<p class="text-sm text-gray-600">Apache NiFi is a data ingestion and ETL tool focused on flow-based programming. It supports drag-and-drop design of data flows and transformations. NiFi allows real-time and batch data movement between systems. It ensures reliable, scalable, and secure data routing.</p>
<a href="https://nifi.apache.org/docs/nifi-docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/e/ec/Alteryx_logo.svg/2560px-Alteryx_logo.svg.png" alt="Alteryx" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Alteryx</h4>
<p class="text-sm text-gray-600">Alteryx is a self-service data analytics and ETL platform. It provides workflow automation for data preparation, blending, and analysis. Alteryx supports integration with databases, applications, and cloud services. It is designed for both technical and business users.</p>
<a href="https://help.alteryx.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Processing - DAG / Orchestration -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="git-branch" class="mr-2"></i> Processing - DAG / Orchestration
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/0/09/Apache_Oozie_logo.svg/1280px-Apache_Oozie_logo.svg.png" alt="Oozie" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Oozie</h4>
<p class="text-sm text-gray-600">Apache Oozie is a workflow scheduler for Hadoop jobs. It coordinates MapReduce, Spark, Hive, and Pig tasks in directed acyclic graphs (DAGs). Oozie supports time and data triggers for workflow execution. It integrates with Hadoop ecosystem for batch processing pipelines.</p>
<a href="https://oozie.apache.org/docs/4.2.0/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/d/de/AirflowLogo.png/1200px-AirflowLogo.png" alt="Airflow" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Airflow</h4>
<p class="text-sm text-gray-600">Apache Airflow is a workflow orchestration platform for scheduling and monitoring tasks. It allows defining DAGs using Python code for flexibility. Airflow supports execution of ETL, analytics, and machine learning pipelines. It provides a web UI for monitoring, logging, and troubleshooting workflows.</p>
<a href="https://airflow.apache.org/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://cdn.worldvectorlogo.com/logos/prefect-1.svg" alt="Prefect" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Prefect</h4>
<p class="text-sm text-gray-600">Prefect is a modern workflow orchestration tool for ETL and data pipelines. It supports Python-based DAGs with easy scheduling and retry mechanisms. Prefect emphasizes observability, error handling, and reliability. It integrates with cloud platforms, data warehouses, and APIs.</p>
<a href="https://docs.prefect.io/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/7/75/Luigi_emblem.svg/1200px-Luigi_emblem.svg.png" alt="Luigi" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Luigi</h4>
<p class="text-sm text-gray-600">Luigi is a Python-based workflow orchestration tool for batch jobs. It allows defining dependencies and execution order for tasks. Luigi handles workflow visualization, retries, and logging. It integrates with Hadoop, Spark, and local processes.</p>
<a href="https://luigi.readthedocs.io/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://dagster-website.vercel.app/images/brand/logos/dagster-primary-horizontal.jpg" alt="Dagster" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Dagster</h4>
<p class="text-sm text-gray-600">Dagster is an orchestration platform for ETL, analytics, and machine learning pipelines. It provides type-aware DAGs for reliable pipeline development. Dagster supports observability, scheduling, and cloud deployment. It integrates with Spark, dbt, Airflow, and other tools.</p>
<a href="https://docs.dagster.io/" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Processing - Unified / Streaming -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="activity" class="mr-2"></i> Processing - Unified / Streaming
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/f/f3/Apache_Spark_logo.svg" alt="Spark (Batch+Streaming)" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Spark (Batch+Streaming)</h4>
<p class="text-sm text-gray-600">Apache Spark provides unified APIs for batch and streaming data processing. It supports in-memory computation for faster analytics. Spark integrates with Hadoop, Hive, Kafka, and Delta Lake. It is used for ETL, analytics, and machine learning pipelines.</p>
<a href="https://spark.apache.org/docs/latest/streaming-programming-guide.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://flink.apache.org/img/logo/png/1000/flink_squirrel_1000.png" alt="Flink" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Flink</h4>
<p class="text-sm text-gray-600">Apache Flink is a stream processing framework for real-time analytics. It supports event-time processing, windowing, and fault tolerance. Flink integrates with Kafka, HDFS, and various storage systems. It is used for real-time ETL, analytics, and alerting applications.</p>
<a href="https://nightlies.apache.org/flink/flink-docs-lts/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://cdn.freebiesupply.com/logos/large/2x/jim-beam-logo-png-transparent.png" alt="Beam" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Beam</h4>
<p class="text-sm text-gray-600">Apache Beam is a unified programming model for batch and streaming data pipelines. It allows running pipelines on multiple execution engines like Spark, Flink, and Dataflow. Beam supports windowing, triggers, and advanced event-time processing. It simplifies building portable and scalable data pipelines.</p>
<a href="https://beam.apache.org/documentation/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://svn.apache.org/repos/asf/kafka/site/logos/originals/png/WIDE%20-%20Black%20on%20Transparent.png" alt="Kafka Streams" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Kafka Streams</h4>
<p class="text-sm text-gray-600">Kafka Streams is a lightweight library for building real-time stream processing applications. It processes data directly from Kafka topics. Kafka Streams supports transformations, joins, and aggregations. It is suitable for microservices and event-driven architectures.</p>
<a href="https://kafka.apache.org/documentation/streams/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.vhv.rs/dpng/d/615-6150272_google-dataflow-logo-hd-png-download.png" alt="Dataflow" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Dataflow</h4>
<p class="text-sm text-gray-600">Google Cloud Dataflow is a fully managed stream and batch processing service. It executes Apache Beam pipelines in a serverless environment. Dataflow supports real-time analytics, ETL, and machine learning preprocessing. It automatically scales resources and handles fault tolerance.</p>
<a href="https://cloud.google.com/dataflow/docs" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://assets.streamlinehq.com/image/private/w_240,h_240,ar_1/f_auto/v1/icons/amazon-web-services/amazon-kinesis-data-analytics-pyt8fban9gofi16ug59pl8.png/amazon-kinesis-data-analytics-e45x4z6djg600zs9ef9stkj.png?_a=DATAg1AAZAA0" alt="Kinesis Data Analytics" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Kinesis Data Analytics</h4>
<p class="text-sm text-gray-600">Amazon Kinesis Data Analytics processes streaming data in real time. It allows building SQL-based or application-based streaming analytics. Kinesis integrates with Kinesis Streams, Firehose, and other AWS services. It is used for real-time monitoring, dashboards, and alerting systems.</p>
<a href="https://docs.aws.amazon.com/kinesisanalytics/latest/sqlref/analytics-sql-reference-dg.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Data Ingestion -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="download" class="mr-2"></i> Data Ingestion
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://mma.prnewswire.com/media/2449607/Flume_Logo.jpg?p=facebook" alt="Flume" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Flume</h4>
<p class="text-sm text-gray-600">Apache Flume is a service for efficiently collecting, aggregating, and moving large amounts of log data. It is optimized for streaming logs into HDFS or other storage systems. Flume supports fault tolerance and reliability. It is used for event-driven and log ingestion pipelines.</p>
<a href="https://flume.apache.org/documentation.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://assets.stickpng.com/thumbs/584809c9cef1014c0b5e4909.png" alt="Kafka" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Kafka</h4>
<p class="text-sm text-gray-600">Apache Kafka is a distributed streaming platform for building real-time pipelines and applications. It handles publishing, subscribing, storing, and processing event streams. Kafka ensures durability and fault tolerance of messages. It integrates with Spark, Flink, Hadoop, and other analytics tools.</p>
<a href="https://kafka.apache.org/documentation/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.vhv.rs/dpng/d/475-4754599_pulsar-logo-vector-pulsar-sticker-hd-png-download.png" alt="Pulsar" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Pulsar</h4>
<p class="text-sm text-gray-600">Apache Pulsar is a distributed messaging and streaming platform. It provides multi-tenant, high-throughput, and low-latency messaging. Pulsar supports persistent messaging and real-time data streams. It integrates with Spark, Flink, and other analytics frameworks.</p>
<a href="https://pulsar.apache.org/docs/next/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://cdn.freebiesupply.com/logos/large/2x/elastic-logstash-logo-svg-vector.svg" alt="Logstash" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Logstash</h4>
<p class="text-sm text-gray-600">Logstash is a server-side data processing pipeline that ingests, transforms, and forwards data. It supports a variety of input sources, filters, and outputs. Logstash is widely used with Elasticsearch and Kibana for analytics. It enables centralized logging and real-time data processing.</p>
<a href="https://www.elastic.co/docs/reference/logstash" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://docs.fluentd.org/~gitbook/image?url=https%3A%2F%2F1982584918-files.gitbook.io%2F%7E%2Ffiles%2Fv0%2Fb%2Fgitbook-legacy-files%2Fo%2Fassets%252F-LR7OsqPORtP86IQxs6E%252F-LkRmVyw0vEoFO0R2Z5g%252F-LkRmp4PSNd4S9mwEdlI%252FFluentd_square.png%3Fgeneration%3D1563851185335977%26alt%3Dmedia&width=300&dpr=4&quality=100&sign=d940457c&sv=2" alt="Fluentd" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Fluentd</h4>
<p class="text-sm text-gray-600">Fluentd is an open-source data collector for unified logging layers. It supports over 500 plugins for input, output, and transformation. Fluentd ensures reliable, scalable, and structured data flows. It integrates with Elasticsearch, Kafka, and cloud storage.</p>
<a href="https://docs.fluentd.org/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.citypng.com/public/uploads/preview/black-official-amazon-logo-701751694791962sskadamgig.png" alt="Vector" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Vector</h4>
<p class="text-sm text-gray-600">Vector is a high-performance observability data pipeline for logs and metrics. It collects, transforms, and routes data efficiently in real time. Vector supports integrations with multiple destinations like Elasticsearch and S3. It is designed for minimal resource usage and high throughput.</p>
<a href="https://vector.dev/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://assets.streamlinehq.com/image/private/w_240,h_240,ar_1/f_auto/v1/icons/amazon-web-services/amazon-kinesis-data-stream-wz8nl3bus1i8rpjk0tmsg.png/amazon-kinesis-data-stream-l2os71f1164xr62lujpck.png?_a=DATAg1AAZAA0" alt="Kinesis Streams" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Kinesis Streams</h4>
<p class="text-sm text-gray-600">Amazon Kinesis Streams enables real-time ingestion of large-scale streaming data. It supports data retention, sharding, and parallel processing. Kinesis Streams integrates with analytics and storage services. It is used for real-time dashboards, monitoring, and ETL.</p>
<a href="https://docs.aws.amazon.com/streams/latest/dev/introduction.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/b/b4/Apache_Sqoop_logo.svg/1280px-Apache_Sqoop_logo.svg.png" alt="Sqoop" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Sqoop</h4>
<p class="text-sm text-gray-600">Apache Sqoop transfers bulk data between relational databases and Hadoop. It supports import/export of tables and incremental data loads. Sqoop integrates with Hive, HDFS, and HBase for analytics. It is widely used for ETL pipelines from RDBMS to Hadoop.</p>
<a href="https://sqoop.apache.org/docs/1.99.7/index.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://assets.stickpng.com/images/62b316e1b223544c209f5e77.png" alt="Fivetran" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Fivetran</h4>
<p class="text-sm text-gray-600">Fivetran is a managed data pipeline service for extracting and loading data. It automates connectors to databases, applications, and cloud services. Fivetran handles schema changes, updates, and incremental loads. It integrates with warehouses like Snowflake, BigQuery, and Redshift.</p>
<a href="https://fivetran.com/docs/getting-started" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://toppng.com/uploads/preview/stitch-lilo-and-stitch-11549873288mq3tllff86.png" alt="Stitch" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Stitch</h4>
<p class="text-sm text-gray-600">Stitch is a cloud ETL service for extracting, transforming, and loading data. It provides pre-built connectors for databases, SaaS apps, and cloud storage. Stitch handles incremental data loads and error logging. It integrates with modern data warehouses for analytics.</p>
<a href="https://www.stitchdata.com/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://vectorseek.com/wp-content/uploads/2023/04/Debezium-Logo-Vector.jpg" alt="Debezium" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Debezium</h4>
<p class="text-sm text-gray-600">Debezium is a change data capture platform for databases. It streams database changes in real-time to Kafka or other streaming platforms. Debezium supports MySQL, PostgreSQL, MongoDB, SQL Server, and more. It enables event-driven architectures and data replication.</p>
<a href="https://debezium.io/documentation/reference/stable/index.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Workflow / Coordination / Governance -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="shield" class="mr-2"></i> Workflow / Coordination / Governance
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/7/77/Apache_ZooKeeper_logo.svg" alt="ZooKeeper" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">ZooKeeper</h4>
<p class="text-sm text-gray-600">Apache ZooKeeper is a centralized service for maintaining configuration information and naming. It provides distributed synchronization and coordination for clusters. ZooKeeper ensures reliability and fault tolerance for distributed applications. It is widely used in Hadoop, Kafka, and HBase ecosystems.</p>
<a href="https://zookeeper.apache.org/documentation.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://assets.stickpng.com/images/62b21f49038aad4d3ed7ca1f.png" alt="Consul" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Consul</h4>
<p class="text-sm text-gray-600">Consul is a service mesh and discovery tool for managing distributed systems. It provides service registration, health checking, and key/value storage. Consul ensures secure service-to-service communication in clusters. It integrates with Kubernetes, cloud services, and microservices frameworks.</p>
<a href="https://developer.hashicorp.com/consul/docs" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/labs/5/53/Etcd_logo.png" alt="etcd" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">etcd</h4>
<p class="text-sm text-gray-600">etcd is a distributed key-value store for configuration and service discovery. It provides strong consistency and reliability for clustered environments. etcd is commonly used with Kubernetes and other orchestration platforms. It supports leader election and distributed coordination.</p>
<a href="https://etcd.io/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.pngfind.com/pngs/m/345-3454614_atlas-logo-png-transparent-club-atlas-png-download.png" alt="Atlas" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Atlas</h4>
<p class="text-sm text-gray-600">Apache Atlas provides metadata management and governance for Hadoop. It tracks data lineage, classification, and access policies. Atlas integrates with Hive, HBase, and other Hadoop ecosystem tools. It ensures compliance with data governance requirements.</p>
<a href="https://atlas.apache.org/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/d/d3/Texas_Rangers_logo.png" alt="Ranger" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Ranger</h4>
<p class="text-sm text-gray-600">Apache Ranger provides centralized security administration for Hadoop. It manages access control policies for files, databases, and applications. Ranger supports auditing, monitoring, and fine-grained permissions. It integrates with Hive, HDFS, HBase, and Kafka for governance.</p>
<a href="https://ranger.apache.org/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://www.awsicon.com/static/images/Service-Icons/Analytics/64/png5x/AWS-Glue-Data-Catalog.png" alt="Glue Data Catalog" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Glue Data Catalog</h4>
<p class="text-sm text-gray-600">AWS Glue Data Catalog is a central metadata repository for ETL and analytics. It stores table definitions, schemas, and data location information. Glue integrates with Spark, Redshift, Athena, and other AWS services. It enables data discovery, governance, and automated ETL workflows.</p>
<a href="https://docs.aws.amazon.com/glue/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/5/58/Collibra-Logo-RGB-FullColor.png" alt="Collibra" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Collibra</h4>
<p class="text-sm text-gray-600">Collibra is an enterprise data governance and catalog platform. It manages data policies, lineage, and stewardship workflows. Collibra integrates with cloud storage, data warehouses, and BI tools. It helps organizations ensure compliance and data quality.</p>
<a href="https://productresources.collibra.com/docs/collibra/latest/Content/Home.htm" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/en/e/ef/Alation%2C_Inc._logo.png" alt="Alation" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Alation</h4>
<p class="text-sm text-gray-600">Alation provides a data catalog platform for discovery, governance, and collaboration. It helps users find, understand, and trust their data assets. Alation integrates with databases, warehouses, and analytics tools. It provides lineage, access management, and compliance features.</p>
<a href="https://www.alation.com/docs/en/latest/" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Monitoring / Observability -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="eye" class="mr-2"></i> Monitoring / Observability
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/2/29/Cloudera_logo_darkorange.png" alt="Cloudera Manager" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Cloudera Manager</h4>
<p class="text-sm text-gray-600">Cloudera Manager provides enterprise-grade Hadoop cluster management. It offers monitoring, configuration, and operational automation. Cloudera Manager supports alerting, metrics, and job tracking. It integrates with Cloudera distribution services and analytics platforms.</p>
<a href="https://docs.cloudera.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1b/Basal_Ganglia_and_Related_Structures.svg/1200px-Basal_Ganglia_and_Related_Structures.svg.png" alt="Ganglia" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Ganglia</h4>
<p class="text-sm text-gray-600">Ganglia is a scalable, distributed monitoring system for high-performance computing. It visualizes metrics for clusters, nodes, and applications. Ganglia supports hierarchical aggregation of monitoring data. It is widely used in HPC and large-scale cluster environments.</p>
<a href="https://developer.nvidia.com/ganglia-monitoring-system" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/3/38/Prometheus_software_logo.svg/2066px-Prometheus_software_logo.svg.png" alt="Prometheus" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Prometheus</h4>
<p class="text-sm text-gray-600">Prometheus is an open-source metrics collection and alerting system. Grafana provides visualization dashboards for Prometheus metrics. Together they enable monitoring of clusters, applications, and pipelines. They support alerting, historical metrics, and data exploration.</p>
<a href="https://prometheus.io/docs/introduction/overview/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/a/a1/Grafana_logo.svg/2005px-Grafana_logo.svg.png" alt="Grafana" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Grafana</h4>
<p class="text-sm text-gray-600">Grafana is an open-source platform for monitoring and observability. It provides dashboards, visualizations, and alerting for metrics and logs. Grafana integrates with Prometheus, Elasticsearch, and cloud services. It is used for cluster and application performance monitoring.</p>
<a href="https://grafana.com/docs/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://imgix.datadoghq.com//img/about/presskit/DDlogo.jpg?dpr=2&auto=format" alt="DataDog" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">DataDog</h4>
<p class="text-sm text-gray-600">Datadog is a monitoring and observability platform for cloud infrastructure. It provides metrics, logs, traces, and dashboards in real time. Datadog integrates with cloud providers, databases, and containers. It enables alerting, anomaly detection, and analytics for operations.</p>
<a href="https://docs.datadoghq.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://newrelic.com/themes/custom/erno/assets/mediakit/new_relic_logo_vertical.png" alt="New Relic" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">New Relic</h4>
<p class="text-sm text-gray-600">New Relic is an observability platform for monitoring applications and infrastructure. It provides real-time metrics, logs, traces, and performance analysis. New Relic supports dashboards, alerts, and anomaly detection. It integrates with cloud services, containers, and databases.</p>
<a href="https://docs.newrelic.com/" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/f/f4/Elasticsearch_logo.svg/1024px-Elasticsearch_logo.svg.png" alt="Elasticsearch" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Elasticsearch</h4>
<p class="text-sm text-gray-600">Elasticsearch is a distributed search and analytics engine for structured and unstructured data. It enables real-time search, analytics, and logging. Elasticsearch integrates with Logstash, Kibana, and Beats. It is widely used for monitoring, log aggregation, and search applications.</p>
<a href="https://www.elastic.co/docs" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://solr.apache.org/theme/images/identity/Solr_Logo_on_white.png" alt="Solr" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Solr</h4>
<p class="text-sm text-gray-600">Apache Solr is an open-source search platform built on Lucene. It supports full-text search, faceted search, and analytics. Solr scales horizontally for distributed search and indexing. It integrates with logging, monitoring, and data pipelines.</p>
<a href="https://solr.apache.org/guide/solr/latest/index.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/f/f8/Splunk_logo.png" alt="Splunk" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Splunk</h4>
<p class="text-sm text-gray-600">Splunk is a platform for searching, monitoring, and analyzing machine-generated data. It indexes logs and events in real time for analytics and alerting. Splunk provides dashboards, reports, and anomaly detection. It integrates with cloud, IT, and security data sources.</p>
<a href="https://docs.splunk.com/Documentation" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/b/b2/Graylog_logo.svg/2560px-Graylog_logo.svg.png" alt="Graylog" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Graylog</h4>
<p class="text-sm text-gray-600">Graylog is a centralized log management platform. It allows real-time search, analysis, and visualization of log data. Graylog supports alerting, dashboards, and structured logging. It integrates with multiple data stores.</p>
<a href="https://go2docs.graylog.org/" class="text-blue-600 hover:underline">Documentation</a>
</div>
</div>
</div>
<!-- Machine Learning / Analytics -->
<div class="mb-12">
<h3 class="text-2xl font-semibold mb-6 flex items-center">
<i data-feather="cpu" class="mr-2"></i> Machine Learning / Analytics
</h3>
<div class="module-grid">
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://icon2.cleanpng.com/20180611/vwx/kisspng-apache-spark-apache-http-server-repository-stream-5b1e5ef4b13237.1025973815287170447258.jpg" alt="Spark MLlib" class="h-8 mx-auto mb-2">
<h4 class="font-semibold mb-2">Spark MLlib</h4>
<p class="text-sm text-gray-600">Apache Spark MLlib is a scalable machine learning library for Spark. It provides algorithms for classification, regression, clustering, and recommendation. MLlib integrates with Spark’s data processing engine for large-scale analytics. It supports Python, Scala, Java, and R APIs for ML pipelines.</p>
<a href="https://spark.apache.org/docs/latest/ml-guide.html" class="text-blue-600 hover:underline">Documentation</a>
</div>
<div class="data-flow bg-white p-4 rounded-lg shadow-md text-center">
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/7/7c/Apache_Ambari_Logo.svg/1200px-Apache_Ambari_Logo.svg.png" alt="Ambari (retired)" class="h-8 mx-auto mb-2">