Big data infrastructure internship | Adaltas

Position description

Large Knowledge and dispersed computing are at the core of Adaltas. We accompagny our partners in the deployment, routine maintenance, and optimization of some of the biggest clusters in France. Considering that not too long ago we also present help for day-working day functions.

As a fantastic defender and active contributor of open resource, we are at the forefront of the facts platform initiative TDP (TOSIT Facts Platform).

Throughout this internship, you will contribute to the improvement of TDP, its industrialization, and the integration of new open up source parts and new functionalities. You will be accompanied by the Alliage skilled crew in charge of TDP editor aid.

You will also work with the Kubernetes ecosystem and the automation of datalab deployments Onyxia, which we want to make readily available to our consumers as properly as to learners as element of our teaching modules (devops, significant knowledge, and so on.).

Your skills will help to develop the services of Alliage’s open supply guidance giving. Supported open resource parts include TDP, Onyxia, ScyllaDB, … For individuals who would like to do some world wide web get the job done in addition to big facts, we now have a pretty practical intranet (ticket management, time administration, advanced research, mentions and connected articles, …) but other awesome capabilities are predicted.

You will follow GitOps launch chains and create articles or blog posts.

You will perform in a group with senior advisors as mentor.

Enterprise presentation

Adaltas is a consulting company led by a group of open up supply professionals concentrating on information management. We deploy and function the storage and computing infrastructures in collaboration with our shoppers.

Partner with Cloudera and Databricks, we are also open up resource contributors. We invite you to look through our web-site and our several technical publications to learn a lot more about the corporation.

Techniques essential and to be obtained

Automating the deployment of the Onyxia datalab necessitates know-how of Kubernetes and Cloud native. You must be cozy with the Kubernetes ecosystem, the Hadoop ecosystem, and the distributed computing product. You will grasp how the simple parts (HDFS, YARN, object storage, Kerberos, OAuth, etc.) work with each other to fulfill the works by using of big info.

A good knowledge of working with Linux and the command line is needed.

Throughout the internship, you will master:

  • The Kubernetes/Hadoop ecosystem in order to contribute to the TDP job
  • Securing clusters with Kerberos and SSL/TLS certificates
  • Significant availability (HA) of companies
  • The distribution of methods and workloads
  • Supervision of products and services and hosted apps
  • Fault tolerant Hadoop cluster with recoverability of missing facts on infrastructure failure
  • Infrastructure as Code (IaC) by means of DevOps tools these kinds of as Ansible and [Vagrant](/en/tag/hashicorp- vagrant/)
  • Be relaxed with the architecture and operation of a info lakehouse
  • Code collaboration with Git, Gitlab and Github


  • Turn into common with the architecture and configuration approaches of the TDP distribution
  • Deploy and test protected and extremely offered TDP clusters
  • Contribute to the TDP knowledge foundation with troubleshooting guides, FAQs and articles or blog posts
  • Actively add thoughts and code to make iterative advancements to the TDP ecosystem
  • Investigate and review the distinctions amongst the most important Hadoop distributions
  • Update Adaltas Cloud making use of Nikita
  • Add to the progress of a software to accumulate purchaser logs and metrics on TDP and ScyllaDB
  • Actively contribute ideas to establish our guidance remedy

Further info

  • Locale: Boulogne Billancourt, France
  • Languages: French or English
  • Commencing date: March 2023
  • Duration: 6 months

Considerably of the digital world runs on Open up Source software package and the Massive Details industry is booming. This internship is an possibility to get valuable experience in the two domains. TDP is now the only truly Open up Supply Hadoop distribution. This is a great momentum. As portion of the TDP workforce, you will have the risk to find out one of the core major details processing designs and participate in the advancement and the foreseeable future roadmap of TDP. We feel that this is an fascinating option and that on completion of the internship, you will be all set for a successful job in Large Knowledge.

Gear readily available

A laptop with the adhering to traits:

  • 32GB RAM
  • 1TB SSD
  • 8c/16t CPU

A cluster produced up of:

  • 3x 28c/56t Intel Xeon Scalable Gold 6132
  • 3x 192TB RAM DDR4 ECC 2666MHz
  • 3x 14 SSD 480GB SATA Intel S4500 6Gbps

A Kubernetes cluster and a Hadoop cluster.


  • Income 1200 € / month
  • Cafe tickets
  • Transportation move
  • Participation in one particular international convention

In the previous, the conferences which we attended incorporate the KubeCon organized by the CNCF basis, the Open Source Summit from the Linux Basis and the Fosdem.

For any ask for for more information and to submit your application, be sure to get hold of David Worms:

Jennifer R. Kelley

Leave a Reply

Next Post

Microsoft Teams November 2022 Update added these new features

Sat Dec 3 , 2022
Microsoft is continuing its assist in creating MS Teams as the finest instrument for interaction and collaboration on-line. We have observed previously that the enterprise is routinely incorporating new characteristics to it, and at the stop of the month arrives a in-depth list of options included all through the thirty […]
Microsoft Teams November 2022 Update added these new features

You May Like