Site reliability engineering at Systematic

T3ch life Companies Apps

                                                    by Systematic
                                                                            September 15, 2021 at 7:28 PM
                    

Site reliability engineering at Systematic

Alexandru Dejanu, Site Reliability Engineer at Systematic: One of the most significant advantages of being an SRE at Systematic is that the team is technology agnostic, which means that I'm interacting with new frameworks frequently.

Systematic is an international software company with a Danish foundation. The development center in Romania started its activity in 2017 and currently has over 120 employees working on education, healthcare, and defence projects.

The Site Reliability Engineer role is crucial at Systematic since it enables the development teams to achieve better product reliability. Alexandru Dejanu, Site Reliability Engineer at Systematic, tells us about how is like being part of the Customer Operations team. Learn more from Alex and his experience as an SRE at Systematic.

apiVersion: apps/v1

kind: SRE

metadata:

? name: SYSTEMATIC

? labels:

? ??app: ALEX DEJANU

At Systematic, I fully embraced the Site Reliability Engineering role, a pretty new paradigm in the IT field (especially in Romania) whose goal is to improve the reliability of systems in production.

Before onboarding the SRE journey, I worked as a DevOps. My main focus was to bridge the gap between development and operation teams by?enabling CI/CD and automating different processes. Still, here at Systematic, I discovered that a new challenge lies in front of me.

Taking a step forward as an SRE, I've understood some of the main responsibilities of this position by helping both development and operation teams to have full visibility to the complete application lifecycle. Here, I am focused on reducing toil and ensuring the applications' availability while also establishing and monitoring service-level metrics.

Three main categories of activities that a Site Reliability Engineer does at Systematic

Now I am part of the Customer Operations department. I'm working in a multi-project squad,?which means that we serve multiple teams, encompassing various industry sectors such as library and learning, healthcare, defence, renewables.

The tech stack is quite diverse, meaning we're working with Kubernetes, Openshift, Azure, Ansible, Grafana, Prometheus, and so forth.Given the vast industries and the technology stack, I can say that no two days are the same.

From a high-level perspective, the main activities are focused around observability (not to be confused with monitoring in which you are handling "predictable" failures, whereas observability provides a way to infer the state of a system), incident response (e.g., postmortems). Last but not least, another big part of the tasks is implementing POC's, capacity management, and incident management.

A day in a life of an SRE and recurrent tasks

Recurrent it's quite a strong word. There aren't intrinsically?recurrent tasks. We're using the Feature Driven Developmentprocess, which is oriented towards speed and efficiency.

One day you could implement a new Prometheus exporter, and the next day you could measure the cost allocation for a K8s cluster. Grafana dashboards are for sure one of our "golden hammers," and at some point, some investigation tasks will require juggling between Lucene's query syntax and PromQL.

But at the end of the day, an essential detail is taking the DevOps mindset a step forward. I wholeheartedly can say that all the daily tasks aim to achieve better product reliability.And when the team's main values are collaboration and progress, we are confident that this goal will be reached.

The challenging part is finding new ways to measure service reliability while proactively monitoring and optimizing workflows.

?Also, one key detail of this role is understanding the importance of Service-Level Objectives,?Agreements and?Indicators. I would say that they're a direct measurement of a service's behavior.

Keeping up to date with the SRE key trends

One of the most significant advantages of being an SRE at Systematic is that the team is technology agnostic, which means that I'm interacting with new frameworks quite frequently. In one project, you could work in a setup consisting of Terraform with Azure and the other Ansible with Openshift.

I tend to read different articles and blog posts like?RedHat. I'm also part of various communities such as StackOverflow, GitKraken which certainly helps with being up to date on multiple topics. Sometimes I'm giving my two cents on different subjects on platforms such as Medium and StackOverflow. Here you can read more about my opinionated views regarding some tech topics I'm interested in:

https://dejanualexandru.medium.com/

https://stackoverflow.com/users/7013263/dejdej

Find out more about Systematic HERE.

Since you scrolled down here
lets enjoy this a bit more!

Blind peek another awesome story

Share this one

Loginro cookies

are very useful and perform various functions that improves your experience with us. You could say that cookies are like "memories" about you, helping browsers remember how you navigated and the choices you made along the way. You can navigate easier and faster on a site that remembers you than on one that doesn't recognizes you. That's why most sites you know and like use cookies.

Settings Accept all

Accept selection

Cookie details

These cookies are set via embedded youtube-videos. They register anonymous statistical data on for example how many times the video is displayed and what settings are used for playback. No sensitive data is collected unless you log in to your google account, in that case your choices are linked with your account, for example if you click “like” on a video..

Name & Provider	Purpose	Expiry
__Secure-3PAPISID, __Secure-3PSID, __Secure-3PSIDCC, APISID google.com	Used to create a user profile and display relevant and personalised Google Ads to the user.	2 years
SID, SSID, HSID,SIDCC Google Analytics	Security cookie to protect users data from unauthorised access.	2 years
NID, CONSENT, NID, DV, UULE, Conversion Google Analytics	Stores visitor preferences and personalises ads on Google sites based on recent searches and interactions.	6 months
CONSENT Google Analytics	Stores visitors’ preferences and personalizes ads.	Persistent

We use a 3rd party analytical software to gather statistical information about our website visitors. These plugins may share content you provide to 3rd party. We recommend you read their privacy policies. A unique text string is saved to identify browser, timestamp for interactions and the browser/source page that led the user to our website. No sensitive information is saved. For more information, read the general Google Privacy policy, analytics.

Name & Provider	Purpose	Expiry
_ga Google Analytics	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.	2 years
_gid Google Analytics	Helps counting and tracking pageviews.	1 day
_gat Google Analytics	Used to throttle request rate, which means that it limits the collection of data on high traffic sites.	session

We use plugins to ensure security, stability and performance of this website. These tools are necessary for us to serve you this website.

Name & Provider	Purpose	Expiry
PHPSESSID Loginro	Preserves user session state across page requests.	Session
REMEMBERME Loginro	If you click to “remember me” when logging in, you get to authenticate in an unauthenticated session.	3 weeks
device_view Loginro	Responsive content helps you experience Loginro properly on every device.	2 weeks
_GRECAPTCHA, _grecaptcha Humanity check	Loginro forms need to be filled by humans and these cookies ensures the operation of reCAPTCHA.	6 months
rc::a, rc::b, rc::c Humanity check	Used to distinguish between humans and bots.	6 months