This is the biggest moving cloud computing since Thomas went from Oracle to be the CEO of google cloud. Charlie was the heart and soul of AWS engineering and product for more than 20 years. He can be a game changer for Microsoft
Can you share more detail? in this pricing model, you only pay if you use the service in the last 30 days. for a company with 100 engineers most of which don't author dashboards this will be a few hundred USD a month for HA, Security, Ptaching, scaling - IMHO self managed engineering time will cost more
HA on Grafana isn't that difficult. The majority of Grafana is browser-side rendered data of queries which are just passed through Grafana to the data source. It's fairly light weight.
Security wise, as far as I know AWS isn't offering anything additional here.
The argument of "but you're paying to not have to worry about it!" that is a fair argument. But comments like this seem to purposely over-state the difficulty of running a not-super-complicated tool like Grafana in order to make "only $600/month, or $7.2k/year", sound more appealing.
I agree with this. I've been running a mission critical Grafana dashboard for a long time now. It took about 15 minutes to setup, and not once in 3+ years has there ever been any maintenance required.
Moving to a different server (or cloning for HA redundancy) takes maybe 5 minutes to copy the JSON and make a crontab entry. It's pretty much the lowest headache software imaginable.
Grafana is one of the easiest internal services to host.
They follow the 12-factor app principles, provide a ready-to-use docker image, support various auth providers ( my company uses GitHub), integrates well with AWS IAM, etc.
As someone else said here, you just have to use RDS to store the state.
Version 1.1.0 includes the upstream open source versions of Elasticsearch 7.1.1, Kibana 7.1.1, and the latest updates for alerting, SQL, security, performance analyzer, and Kibana plugins, as well as the SQL JDBC driver. You can find details on enhancements, bug fixes, and more in the release notes for each plugin in their respective GitHub repositories. See Open Distro’s version history table for previous releases.
We (The GCP Ops Management and Stackdriver teams) are working hard in multiple fronts to deliver innovation (Such as service graph highlighted in day 1 keynote at Next and GKE monitoring) and at the same time deliver first class scale and reliability. its a journey, but we have made a lots of improvements over the last 12 month, and will continue to raise the bar in the next 6-12 month. We have many very large customers as well as startups using Stackdriver as the core of their Ops and SRE command and control center. I can personally guarantee that our users getting great UX and reliability is top of mind for the entire team. I would appreciate it if you can flag to our team any time you see a page not found or any other experience in Stackdriver that you feel does not meet the bar - we listen and we will resolve bugs one by one to meet your expectations. we have an email list, bug tracker and a feature request forum all listed here https://cloud.google.com/stackdriver/docs/contact-us . you can also submit in context feedback from the Stackdriver and GCP consoles which will be reviewed by the team. finally, please feel free to DM me on Twitter @markcartertm . We care deeply and would love to hear and respond.
Debugging in production without impacting users is the type of magic that only Google SRE could do for a long time... nice to see Google brings this to everyone - and free is great price :)