That is definitely one bottle-neck, creating the labelled data-sets is very expensive, besides that there are a large number of privacy sensitive issues that need to be taken care of around the whole labeling process to ensure that patient confidentiality is maintained. Large datasets exist but in general are not available for research for that reason, only a few hospitals have made datasets available without restriction.