@SmaineDev

afup

 @SmaineDev

🥋Trainer, Dev 🐛

🚀 www.smaine.me

     @SmaineDev

🖊  smaine-milianni.medium

      ismail1432

     smainemilianni

🍿 github.com/ismail1432/conferences

🐘👃👣☠️

@SmaineDev #forumphp

Le Guide du RUNNER en

1 Volume 👍🏽

 @SmaineDev
 @SmaineDev

LE RUN

Smaine Milianni

What is Observability? 🩺

 @SmaineDev

What is observability? 🩺

 @SmaineDev

📖  The observability is the ability to understand, monitor, and diagnose a system/software

 @SmaineDev

Why

 @SmaineDev
 @SmaineDev

Everything fails, all the time"

Werner Vogels, CTO Amazon.

Why?

💡To ensure your system is 💯 functional

 @SmaineDev

Why?

💡To ensure your system is 💯 functional

⚡️ Early detection of the problem(s)

 @SmaineDev

Why?

💡To ensure your system is 💯 functional

⚡️ Early detection of the problem(s)

🐌 Detect performance issue

 @SmaineDev

Why?

💡To ensure your system is 💯 functional

⚡️ Early detection of the problem(s)

🐌 Detect performance issues

 Catch weird behavior(s)

 @SmaineDev

Why?

💡To ensure your system is 💯 functional

⚡️ Early detection of the problem(s)

🐌 Detect performance issues

 Catch weird behavior(s)

🔓 Minimize the risk of downtime

 @SmaineDev

Why?

💡To ensure your system is 💯 functional

⚡️ Early detection of the problem(s)

🐌 Detect performance issues

 Catch weird behavior(s)

🔓 Minimize the risk of downtime

⏳Reduce the time of downtime

 @SmaineDev

Why❓

💡To ensure your system is 💯 functional +

⚡️ Early detection of the problem(s) +

🐌 Detect performance issues +

Catch weird behavior(s) +

🔓 Minimize the risk of downtime +

⏳Reduce the time of downtime

=

❤️ Client Happiness 😍

 @SmaineDev

How do we do observability? 🩺

📝 Logs

 @SmaineDev

How do we do observability? 🩺

📝 Logs

👀 Metrics

 @SmaineDev

How do we do observability? 🩺

📝 Logs

👀 Metrics

👣 Traces

 @SmaineDev

The 3️⃣ Pillars of Observability

 @SmaineDev

👣Traces

Metrics📊

📝Logs

How do we do observability? 🩺

📝 Logs

👀 Metrics

👣 Traces

🚨 Alerts

 @SmaineDev

📝 Logs

Record activities, and events of a system.

Help to debug and detect anomalies.

 @SmaineDev

📝 Logs

 @SmaineDev

👀 Metrics

Help to track performance and analysis of the system.

Observe the health of your application

 @SmaineDev

👀 Metrics

 @SmaineDev

👣 Traces

Provide insight into the flow of the application,

A trace represents the entire journey of a request or action as it moves through all the nodes of a distributed system.

 @SmaineDev

👣 Traces

 @SmaineDev
 @SmaineDev

Who Is Responsible for Observability?👮‍♂️

 @SmaineDev
 @SmaineDev

OBSERVABILITY

 @SmaineDev
 @SmaineDev

build it

run it

YOU

YOUSIGN

 @SmaineDev

12 000 clients 😍

4 million signatures 🔥

 @SmaineDev

🏉 Organization 🏠

Squad

Squad

👤 7 squads

👮‍♂️ A Squad is responsible for a dedicated perimeter

🔐 Each squad has a round-robin runner and a release operator

Squad

Squad

 @SmaineDev
 @SmaineDev

🛠️ Everything related to the tech stack

🔀 CI/CD, Tools

🦸🏿‍♂️ Developer assistant

🤝 Bridge between SRE and Dev

✅ Release 

 @SmaineDev

👨🏽‍🚒 The "Runner"*

👀 Monitor

🗣️ Communicate

⚒️ Handle hotfixes

🖊️ Write Postmortem

*On Call Engineer

 @SmaineDev

My 1st day as Runner

 @SmaineDev

Who can be a Runner❓

 @SmaineDev

Who can be a Runner❓

🫵 Anyone who wants and who is experienced in the technical stack and the tools used.

👥 Pair RUN for the WIN

 @SmaineDev

Qualities of a Runner 👑

🧮  Analysis

👌 Self-control

🗣️ Communication

 @SmaineDev

🐘👃👣☠️

 @SmaineDev

What is pleasant about being a Runner

🍄Increase soft and hard skills

Make you more conscientious about your delivery 👔

🤝 Exchange with other teams

🐘👃👣☠️

 @SmaineDev

🧠 Hard to focus on your tasks

😵 Fatigue and stress

What is unpleasant about being a Runner

 @SmaineDev
 @SmaineDev
 @SmaineDev

How an error is detected

📊 Monitoring

🗣️ Human report

🚨 Alert

 @SmaineDev

🚨 Alert

🤖 Automated Alert

 @SmaineDev

🚨 Alert

🔥 War Room

 @SmaineDev

🚨 Alert

 Status Page 📃

🔥 War Room

https://yousign.statuspage.io

 @SmaineDev

👷🏽‍♂️ Resolution

 @SmaineDev

🚨 Alert

 Status Page 📃

🔥 War Room

👷🏽‍♂️

Resolution

 Status Page 📃

Incident Workflow

 @SmaineDev

🚨 Alert

 Status Page 📃

PostMortem 💀

🔥 War Room

👷🏽‍♂️

Resolution

 Status Page 📃

Incident Workflow

 @SmaineDev

PostMortem 💀

"Learning from failures"

🤔 What happened

🔨 Root cause

⏱️ Timeline

💨 Detection

🧯 How it was fixed

🎓 Learning lessons

🐣 Next step(s)

....

 @SmaineDev

PostMortem 💀

PostMortem

follow up

actions 🔧

IMPROVE WHAT WAS WRONG, AVOID TO REPRODUCE

⚒️ TOOLS

Opsgenie

Sentry

Graylog

Slack

Discord

Redash

Datadog

 @SmaineDev

Anticipation is the 🗝️

 @SmaineDev

Anticipation 🗝️

📊 Create/Improve your own dashboard(s)

⏰ Create your own alerts

🧯What can help you before/if things go wrong

🖋️ Write logs

📑 Documentation

🥷 Process

🧪 Experiment

 @SmaineDev

👍🏽 https://openfeedback.io/forumphp2023/2023-10-13

🍿 https://github.com/ismail1432/conferences

 @SmaineDev