Hacker Newsnew | past | comments | ask | show | jobs | submit | robz75's commentslogin

What are the pain points your are facing with data cleaning? How do you handle it for now?


Data cleaning depends on the problem domain.

Compare output from a spoctrometer (or spectrograph) vs. eliminating outliers from an almost linear process. One will wreck your data and the other is the only correct thing to do.

         *         
**** ****


Why? What's currently annoying about notebooks that you have to deal with compared to just directly going to users?


Ah, well, rereading your original post I realize now this isn't necessarily painful for me. Perhaps though, the annoying aspect is seeing others use proprietary excel spreadsheets without a data lake. Conway's Law?

Does VS here mean Visual Studio? I would not call myself a data engineer, I just play one at work sometimes. Many hats, yknow?


"the annoying aspect is seeing others use proprietary excel spreadsheets without a data lake" => what's painful about that?

VS = compared to, versus


Hah okay. I read VS different from vs. The pain, in part, is hidden functions, rarely ever inline documentation, difficult to reuse or repurpose, Windows-centric, etc.


Thanks for your feedback, I can understand that it's not enoyable to record a video.

But for now we will keep these steps and this process since it's important for our recruitment process.


Thanks for the feedback.

I can understand that it can feel like an investment to apply.

But for now those are important steps we need in our recruiting process.

Let me know if you have any other feedback :)


Evaboot | Refactoring Python Django Backend | Full-Time | REMOTE | https://evaboot.com/

Hi, we are just 2 co-founders and we bootstrapped to 2M€ ARR in 2.5 years.

Evaboot help sales teams create prospecting databases from LinkedIn.

Goals: • Refactoring the existing Python-Django backend codebase. • New features development after refactoring period.

Requirements: • Serious experience in Python, Django, Celery. • You have refactored a lot of spaghetti code before (and not just your own).

Bonus Points: • Experience with Heroku • Django REST framework • Scraping projects

Details, context and application right here:

https://twilight-barnacle-bc7.notion.site/The-worst-Python-D...

Don't be shy and don't hesitate to ask any question :)


I was curious about this position (the fact that you are upfront that you need to do a big refactor), but then the application form was very off-putting.

> Tell us about yourself in 3 KPIs, with a brief explanation and examples. (A KPI is a number that evaluate performance in a specific aspect.)

> Record and upload an unedited face-cam video (3 to 10 minutes) where you explain a problem you had this week and how you solved it on your own.


Thanks for the feedback. Indeed I don't want to get the wrong expectations & be as clear as possible. It will be a painful cleaning job at first (at least that's how refactoring is viewed by most).

And yes we do have a carefully crafted process. Those questions / requests are very meaningful for us.


Recording a video of yourself explaining something is just a non-starter for me. Maybe I'm just old and not part of the Instagram/TikTok generation, but I'd feel very self-conscious. I doubt I'm the only one.


I kinda like it, not a big deal and filters out most lazy applicants


[flagged]


Thanks for your brutally honest feedback. What makes you think it's shit-tier company? How do you evaluate that?


[flagged]


Why would this mean it's a lower tier company to you?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: