I/O is hard to benchmark so it's often ignored since you can just scale up your disks. It's a common gotcha in the cloud. It's not a show stopper, but it blows up the savings you might be expecting.
What does data frames mean in this context? I'm used to them in spark or pandas but does this relate to something in how duckDB operates or is it something else?
At 50k USD / year revenue it would be impossible, probably a lot easier at 1m / year. Higher profile use cases are known, and companies tend to comply with licenses rather than pirate software. Just like now, you lose a lot of control as soon as your source is available.
Without any context of culture or country, just trying to be helpful: in my limited (<20 total interviews) experience, I would think about budget issues.
Meaning, what you ask for (or how expensive you are perceived, if you have that strong resumee) for the industry you apply, may be too different and leading to limited access.
Sometimes I feel junior people have it easier (I felt like I did, personally) since the expense in salary is pretty limited compared to either other roles or more senior people
Thanks! However... I am very junior right now lol (<1 YOE)
(I have applied to both competitive as well as more niche firms fwiw, I expect there have been stronger resumes I've "lost to". Though, my degree isn't a "common" one even though it's actually very suitable.)
It's not you. As a junior, there are going to be so many candidates for anything you feel qualified to apply for. It's a numbers game. IMHO, you should still tailor your resume to fit the position. I'd put together a too long resume with everything, and trim off irrelevant stuff so it's down to one page (maybe two, if that's the standard locally) of mostly relevant things for each position; when your resume does get looked at, you want it to pass muster.
Do your best to network. Think about the people you went to school with: who among them would you like to work with.
Every week, send 2-5 of them an email, remind them of what you did in school together, ask them how their summer/etc was, how are they doing at job hunting/if they like the job they found. If you don't mind looking a little desperate, in that email write something like I'm having a hard time getting interviews, have you found anything that works? If you don't wait for their reply... if they got hired, ask if their company is hiring; if they're still looking ask for tips.
Check in with your school's career center. Check in with your favorite professors.
Check in with your parents' friends and your friends' parents.
A personal connection is likely to get your resume looked at closely and not just ignored because there were 1000 applicants and 10 candidates seemed worth interviewing in the first 100, so they didn't look at the rest. It might not get you an interview, but it helps your chances; also, a personal connection might get a referral to an unrelated opening which is unlikely for an unconnected application. I would definitely send a friend's kid to another friend at a different company if I thought that was a potential match, but I wouldn't consider it for a resume that just came in.
It's been mentioned but I want to add that the original idea of the post (mid size VPS hosting apache spark) might be missing that spark is ideal for distributed and resilient work (if a node fails the framework is able to avoid losing that work).
If you don't need this features, specially the distributed one, going tall (single instance with high capacity, replicate when necessary) or going simpler (multiple servers but without spark coordinating the work) could be good options depending on your/the team's knowledge
If you have a decent GPU or a modern Mac, you can run something like LM Studio (https://lmstudio.ai/) or Ollama (https://ollama.com/), configure the tool such as Cursor to use those models you've downloaded (personally I use Zed.dev), and then everything happens straight on your computer. Responses will be somewhat slower and not as good as state-of-the-art models, but they still can be helpful.
No, he basically means thay companies will not allow LLMs on their own code, I think.
I work in a multinational conglomerate, and we got AI allowed ... 2-3 weeks ago. Before that it was basically banned unless you had gotten permission. We did have another gpt4 based AI in the browser available for a few months before that as well.
Configuring the machine with anything other than the default settings
A new machine with the default settings is a nightmare. The apap algorithm will spike your pressure up to 16 (which is too high for most people) one minute and drop down to 4 (which is too low for most) the next. At the high end is aerophagia and leaks, at the low end, your apnea events will still happen.
When I called my doctor and explained the machine made my sleep worse, her office said “we gave you a machine. It’s out of our hands. Would you like us to refer you to a psychologist? The problem must be in your head.” This is almost verbatim.
The equipment supplier wasn’t much better. They offered to do a “mask fitting” but it was mid 2021 and the woman didn’t even want to be in the same room as me for fear of covid.
So what should they have done? I dunno. Eventually a friend recommended a mask that worked for me. I learned to analyze the data reported by the machine and changed the settings myself to something that worked for me. It took a year, the worst year of my life. Tired and constantly waking up. Thank god for YouTube and OSCAR. knowing what I know now, I can (and have) help someone else do this process in a week or two, but I had nobody to even ask.
I think it would be a good case for handing off from the prescribing doctor to some kind of home health aid / respiratory therapist who can make house calls and help make adjustments at the home.
It needs extensive calibration, if you don't want it to either be ineffective, or wake up in the morning, with your stomach pumped full of air, like a balloon.
Recently I have been thinking about this, because I feel I have managed to become way more organized than I ever thought it was possible.
What is working for me right now is noting everything in a calendar so I cannot forget it or as TODO in a somewhat heavy personalized Obsidian configuration.
A few years ago (5-6 aprox) I started copying my older co-workers habits to see myself improve. Physical notebooks were soon discarded because I never remember where I wrote down things.
I used a TODO plugin in sublime which worked for several months, until I felt I needed screenshots so I moved to OneNote. After a while I became frustrated with not being able to customize it enough, so I started trying out different things. I saw a coworker using Obsidian, watched a couple long YouTube videos to learn how to customize, and I'm never going back.
My team this week told me they are impressed with how much info I write down and it was a very proud moment for me!
PyIceberg is nice but we had to drop it because it's behind Java API and it's unclear when it will match up, so depending on which features are needed I'd look it up