@SmaineDev
@SmaineDev
🍿 github.com/ismail1432/conferences
@SmaineDev
@SmaineDev
Smaine Milianni
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
Everything fails, all the time"
Werner Vogels, CTO Amazon.
💡To ensure your system is 💯 functional
@SmaineDev
💡To ensure your system is 💯 functional
⚡️ Early detection of the problem(s)
@SmaineDev
💡To ensure your system is 💯 functional
⚡️ Early detection of the problem(s)
🐌 Detect performance issue
@SmaineDev
💡To ensure your system is 💯 functional
⚡️ Early detection of the problem(s)
🐌 Detect performance issues
Catch weird behavior(s)
@SmaineDev
💡To ensure your system is 💯 functional
⚡️ Early detection of the problem(s)
🐌 Detect performance issues
Catch weird behavior(s)
🔓 Minimize the risk of downtime
@SmaineDev
💡To ensure your system is 💯 functional
⚡️ Early detection of the problem(s)
🐌 Detect performance issues
Catch weird behavior(s)
🔓 Minimize the risk of downtime
⏳Reduce the time of downtime
@SmaineDev
💡To ensure your system is 💯 functional +
⚡️ Early detection of the problem(s) +
🐌 Detect performance issues +
Catch weird behavior(s) +
🔓 Minimize the risk of downtime +
⏳Reduce the time of downtime
=
❤️ Client Happiness 😍
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
👣Traces
Metrics📊
📝Logs
@SmaineDev
Record activities, and events of a system.
Help to debug and detect anomalies.
@SmaineDev
@SmaineDev
Help to track performance and analysis of the system.
Observe the health of your application
@SmaineDev
@SmaineDev
Provide insight into the flow of the application,
A trace represents the entire journey of a request or action as it moves through all the nodes of a distributed system.
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
Squad
Squad
👤 7 squads
👮♂️ A Squad is responsible for a dedicated perimeter
🔐 Each squad has a round-robin runner and a release operator
Squad
Squad
@SmaineDev
@SmaineDev
@SmaineDev
*On Call Engineer
@SmaineDev
@SmaineDev
@SmaineDev
@SmaineDev
🧮 Analysis
👌 Self-control
🗣️ Communication
@SmaineDev
@SmaineDev
🍄Increase soft and hard skills
Make you more conscientious about your delivery 👔
🤝 Exchange with other teams
@SmaineDev
🧠 Hard to focus on your tasks
😵 Fatigue and stress
@SmaineDev
@SmaineDev
@SmaineDev
📊 Monitoring
🗣️ Human report
🚨 Alert
@SmaineDev
🚨 Alert
@SmaineDev
🚨 Alert
🔥 War Room
@SmaineDev
🚨 Alert
Status Page 📃
🔥 War Room
https://yousign.statuspage.io
@SmaineDev
@SmaineDev
🚨 Alert
Status Page 📃
🔥 War Room
👷🏽♂️
Resolution
Status Page 📃
@SmaineDev
🚨 Alert
Status Page 📃
PostMortem 💀
🔥 War Room
👷🏽♂️
Resolution
Status Page 📃
@SmaineDev
PostMortem 💀
🤔 What happened
🔨 Root cause
⏱️ Timeline
💨 Detection
🧯 How it was fixed
🎓 Learning lessons
🐣 Next step(s)
....
@SmaineDev
PostMortem 💀
PostMortem
follow up
actions 🔧
Opsgenie
Sentry
Graylog
Slack
Discord
Redash
Datadog
@SmaineDev
@SmaineDev
📊 Create/Improve your own dashboard(s)
⏰ Create your own alerts
🧯What can help you before/if things go wrong
🖋️ Write logs
📑 Documentation
🥷 Process
🧪 Experiment
@SmaineDev
👍🏽 https://openfeedback.io/forumphp2023/2023-10-13
🍿 https://github.com/ismail1432/conferences
@SmaineDev