Can someone explain what's the InfluxData's market? Or how they make/plan to mak...

pauldix · on Oct 1, 2023

So it sounds like you're asking about our use cases. We have customers across almost every vertical. But what's most common are application and server monitoring, sensor data (industrial, rockets, satellites, etc.), and network monitoring.

Metrics is certainly one use case that people pay us for. With v3, we expect that real-time analytics and some more data warehousing types of use cases will become interesting. We always envisioned InfluxDB as a store for observational data of all kinds, not just metrics.

On how we make money, we sell our products. We have at this time:

- InfluxDB v1 Enterprise (a self-managed, clustered implementation of InfluxDB)

- InfluxDB v1 Cloud (Enterprise, but as as single-tenant managed service. We still run this for hundreds of customers)

- InfluxDB v2 Cloud (multi-tenant, usage based, we're running this for thousands of customers)

- InfluxDB v3 Serverless (multi-tenant, usage based v3)

- InfluxDB v3 Cloud Dedicated (single-tenant, resource based pricing)

- InfluxDB v3 Clustered (self-managed v3, clustered database)

We'll have single server versions in the future, but we're a bit off from that. Right now our focus is on continuing support for our v1 and v2 customers, and further developing our v3 products for new customers and customers that want to migrate over.

atombender · on Oct 1, 2023

Does InfluxDB support event-based data models? For example, imagine something simple an SQL database's query log. Each query is an event, with data such as latency, rows, block I/O and so on, and metadata such as the database, the full SQL query, and so on. This is the kind of use case where traditional "measurement"-based time series databases like Prometheus aren't a good fit, because you have huge column cardinality for the labels (one value per query). Meanwhile, more general-purpose columns databases like ClickHouse and BigQuery have no issues with this type of data.

pauldix · on Oct 1, 2023

InfluxDB v3 is built to handle this kind of data. It's a columnar database, using object storage and Parquet files for persistence.

reactordev · on Oct 1, 2023

There in lies the problem. With shifting priorities and shifting strategies I don’t see them actually knowing what their market is.

jeroenhd · on Oct 1, 2023

I see Influx pop up a lot in communities like Home Assistant for doing time series data collection. I'm using Postgres for that myself (not a great experience but I knew that when I was too lazy to find alternatives during setup) but people seem very pleased with its performance.

I imagine similar dashboard services that don't necessarily work well in Prometheus are a good market for these types of databases. Prometheus is nice, but I don't think it's suitable for all Influx use cases (and vice versa).

ilyt · on Oct 1, 2023

VictoriaMetrics all-in-one binary is honestly everything that I need in my home IoT things.

Prometheus-compatible interface for query, a bunch of ingest protocols, smaller memory usage than InfluxDB (v1, haven't tested v2 coz new language have less grafana support). Options to scale too

iworshipfaangs2 · on Oct 1, 2023

Do you know if there is any particular advantage of Influx over Prometheus for IOT stuff? I also have noticed that Influx is way more popular in that space, but I don’t know whether the reasons are technical or just social (more tutorials, more shared experience, etc).

46Bit · on Oct 1, 2023

Influx is a full time series database. It's less opinionated than Prometheus.

capableweb · on Oct 1, 2023

As is tradition, hosted service: https://www.influxdata.com/influxdb-pricing/

Basically either you manage it yourself, or you pay them to do Serverless/Dedicated/Clustered hosted setup for you.

influx · on Oct 1, 2023

They seem all over the place.

https://www.theregister.com/2023/07/11/influxdata_apologizes...

vb-8448 · on Oct 1, 2023

Yeah, I know, but there are a lot of competition in this field. Every cloud provider, Grafana and other players have their own managed Prometheus compatible solution.

cmrdporcupine · on Oct 1, 2023

This I guess is the mystery to me. Projects like this are great. But how are people funding them? (AKA how can I get my own paws on some investment $$ to work on cool database^Wother-systems-level tech? I'll even promise to try to make $$)

GauntletWizard · on Oct 1, 2023

I am about the biggest Prometheus stan that you can find, but I will not mock or denigrate influx db. They are a runner-up, but there is room in this market for runner-ups, and their feature set does match some that Prometheus is not good at.

ilyt · on Oct 1, 2023

I think they missed one thing: Nobody wants to get locked in with the runner-up.

If they on top of that provided some compatibility layer for Prometheus/PromQL, like few other competitors did, then the prospective enterprise client have warm fuzzies that if they don't like it they don't need to rewrite entirety of their stack to work with something else. People could also use the existing ecosystem and "just plug it in", even replacing Prometheus instances they might have.

In the end people want to ingest the metrics and display it in Grafana. They don't need another visualisation solution that has less support and documentation. They don't want to learn new weird query language that is simulatenously more verbose and less readable than PromQL or even influxQL from v1.

vb-8448 · on Oct 1, 2023

Don't get me wrong, I don't denigrate Influx DB. They have some interesting features that prometheus doesn't have, I just don't think that this features can lead to a mass adoption(and money).

Let's put this way: is there any killer feature that can ditch most of the Prometheus installations in their favor?

ilyt · on Oct 1, 2023

And it *has* to be feature, not just "it's faster", because they are (much) faster time-series databases that support PromQL and are near-drop-in to your infrastructure.

manicennui · on Oct 1, 2023

I think you'd find that Prometheus is way behind a number of solutions in the corporate world, including Graphite, DogStatsD, etc. Popularity on HN does not translate to real world popularity.

vb-8448 · on Oct 1, 2023

Most of the companies in the corporate world have more than a monitoring system, at least 3 in my experience: sysadmins have at least one, network guys another one and one for legacy platforms (mainframes, as400 and so on).

Plus, in the last years with the rise of Kubernetes, most corporates have at least one cluster with Prometheus monitoring it.

Always, base on my experience, Prometheus is very popular, but the adoption is not so wide due to its `oss nature`: CTOs want someone to blame when things go wrong.

hagen1778 · on Oct 1, 2023

That's only a matter of time when they hire younger engineers who are familiar with modern monitoring systems and eager to apply their knowledge in practice.

manicennui · on Oct 2, 2023

What is it that you think prometheus offers over other solutions? It is more likely that the younger engineer is going to learn that companies don't care about what is popular on HN.

hagen1778 · on Oct 2, 2023

> What is it that you think prometheus offers over other solutions?

I like Prometheus and think this is a great piece of software. But even if we won't go into actual details, Prometheus is baked in into Kubernetes monitoring [0]. That's the first monitoring system young engineers will meet with when learning k8s. Although, k8s and Prometheus are both CNCF projects which means both of them will be promoted in synergy with each other.

> It is more likely that the younger engineer is going to learn that companies don't care about what is popular on HN.

This is not what I think younger engineers do :)

[0] https://kubernetes.io/docs/tasks/debug/debug-cluster/resourc...

kiitos · on Oct 2, 2023

In the year of our lord 2023, Graphite, Statsd, etc. are anachronisms. Prometheus is wildly superior to those tools.

matsemann · on Oct 1, 2023

Prometheus only handle aggregated data, though. While with influx you can store the events themselves with labels etc. While Prometheus is often good enough for standard metrics, it is just things it can't handle.

hagen1778 · on Oct 1, 2023

> Prometheus only handle aggregated data, though.

That's not true. You're referring to pull-based approach for metrics collection. It has its tradeoffs (like fixed interval scraping), but has a lot of benefits too (like higher reliability). Check the following link [0] from VictoriaMetrics docs, which supports both push and pull approaches. Prometheus also gained push support this year, though.

However, the main difference between Prometheus-like systems (Thanos, Mimir, VictoriaMetrics) and more traditional DBs for time series like InfluxDB or TimescaleDB is that first are designed to reflect system's state, and last are designed to reflect system's events. That's the main difference in paradigm, data model, and query languages. There is a reason why PromQL is so easy in 99% of cases, and so complex and annoying when users want to express what they get used to in traditional databases.

I'm saying this because I went through creating a Grafana datasource for ClickHouse [1] and I felt how complicated it is to express a most straightforward PromQL query in SQL, and vice versa.

If you'd like to learn more about differences between common queries for plotting time series in PromQL and SQL see my talk here [2].

[0] https://docs.victoriametrics.com/keyConcepts.html#write-data

[1] https://grafana.com/grafana/plugins/vertamedia-clickhouse-da...

[2] https://youtu.be/_zORxrgLtec?t=835

ilyt · on Oct 1, 2023

AFAIK only difference is that in Influx a given row can have more than one value so say "interface traffic" would be

    interface_if_octets host=router,instance=eth0,rx=123584,tx=213956

while in prometheus it would be

    interface_if_octets host=router,instance=eth0,type=rx 123584
    interface_if_octets host=router,instance=eth0,type=tx 213956

which in theory yes it is more compact but it gave me more annoyances than advantages during querying

>While Prometheus is often good enough for standard metrics, it is just things it can't handle.

My experience is that just anything made to ingest and analyze logs ends up mediocre for metrics and vice versa. I don't think I've seen single product that did both well or efficiently. So I'd rather have good metrics and just use ELK/Graylog/whatever else for logs.

matsemann · on Oct 1, 2023

Not really the biggest difference, I feel. In Prometheus, you only have the data you scrape. So lets say you scrape every minute. Then all you have is a counter per pod increasing from some value to something else.

With influx you can save every event. So you know exactly when it happened, the unique labels for that event etc. It's a completely different paradigm.

So in Prometheus you have

  myCounter,pod=1,time=20:23,value=1000
  myCounter,pod=2,time=20:23,value=500
  myCounter,pod=1,time=20:24,value=1100
  myCounter,pod=2,time=20:24,value=700

so all you know is that some event happened 100 times on pod1 and 200 times on pod2 the last minute. But with influx you could have a row for every single event. Of course that explodes the query time in comparison, but allows you to do much more with the data if needed.

lakomen · on Oct 1, 2023

Influx is a wholesome solution where Prometheus is just the time series database

vb-8448 · on Oct 1, 2023

Maybe i miss something, but the version 3 is just the database, they abandoned the TICK stack some time ago.

justin_oaks · on Oct 1, 2023

And I thank them for it. I was using the TICK stack a few years ago and the switch to TIG (Telegraf, InfluDB, and Grafana) has been a breath of fresh air.

Telegraf and InfluxDB are solid, but Chronograf was behind Grafana in usability and features.

Kapacitor was pretty rough. The language it used was hard to write, the docs were barebones, somewhat confusing, and sometimes inaccurate. When I switched off of Kapacitor, the CPU usage on the server dropped significantly too. So I'm guessing Kapacitor wasn't too CPU friendly either.

Too · on Oct 2, 2023

Can you expand what you are missing?

Prometheus is not just the db itself, it’s the ecosystem around it. You’ve got service-discovery, alertmanager and basically every application in existence having a /metrics endpoint and some pre-made Grafana dashboard.