I'd say labels (for supervised ML) are fundamentally different from rules (for expert systems), because
- labels are easy to decide in many cases
- rules require humans to analyze patterns in the problem space
- labels only concern each data point individually
- rules generalize over a class of data points