Hacker Newsnew | past | comments | ask | show | jobs | submit | dacm's commentslogin

Very interesting take to counteract the 23andme data leak


Forcing you to give consent to receive a newsletter in order to download the ebook is not the way to go


Nobody is forcing you. You don't have to fill the form if you don't want. Exchanging the book for your email seems a fair deal to me.


Do you expect them to get nothing in return?


That's the traditional definition of "free" (as in beer).


We've been packaging pandas in a lambda which is used to perform some calculations, but being a 50 MB zip file makes cold starts of about 6-8 secs. We're lucky that the service has little use, thus our way to workaround it is by having a lambda warmer which is run every 5 minutes and invokes N pandas lambdas. I'd be very interested in knowing if Layers has some feature to avoid this kind of issue.


We had the same cold start problem and couldn’t find a way to reliably keep things warm. For instance, Lambda would often spawn a new instance rather than re-use the warm one.

In the end, we came to the conclusion that Amazon is smart and won’t let you hack together the equivalent of a cheaper EC2.


I don't think it's deliberately so, just that developing a solution requires scheduling and routing to cooperate. Normally they're considered by separate systems. As your execution pool expands, this problem becomes worse, not better.

On the other hand, their incentive to solve the problem is relatively weak vs an on-premise alternative.


If I were doing this today, I would prototype the problem in Python and after realising the startup penalty, would rewrite it in D's Mir [1] or Nim's ArrayMancer [2].

Life on a lambda is too short to pay 6-8 second startup penalty over and over millions of time.

[1]: https://github.com/libmir/mir-algorithm [2]: https://mratsim.github.io/Arraymancer/


Our problem is that we have a team of data scientists who are familiar with Python, plus a decent set of custom tools written in it, so changing languages isn't an option


that's often the current explanation for continued use of Pyhton and R.

Often it is a sign that the problem is not "big" enough (eg: not crunching truly large data sets) OR data science team gets disproportionate amount of goodwill (thus money) to spend on its foibles. :)


How did you get the zip down to 50MB. I was under the impression that pandas+numpy was closer to 300MB and bumped up against AWS size limits. I was considering building some hacked together thing with S3

I came to this thread specifically to find out about numpy and pandas on lambda.


We've been running a stripped down version of numpy + scipy + matplotlib in lambda. We'd build the environment in a docker container with Amazon linux, manually remove unneeded shared objects and then rezip the local environment before uploading to s3.

A similar method is described here: https://serverlesscode.com/post/deploy-scikitlearn-on-lamba/

Layers should make this entire process easier.


When I worked on this I used this article as a reference: https://serverless.com/blog/serverless-python-packaging/ and also ended up with a huge image. What that article didn't mention is that the virtual environment folder should be excluded in the serverless config file, as the runtime is provided by boto3. So adding:

package: exclude: - venv/

would reduce the size considerably (to 50 MB in my case)


Why though? Is it cheaper than just running a bunch of servers?


It is in our case. This is a service which is very seasonal, so it may be used during a couple of days each month only. Having a bunch of instances mostly idle would definitely be more expensive


How much delay from a cold start can your application tolerate? On the order of tenths of a second or up to one second?


Being that the data is queried from a web app through HTTP, the shorter the better. Around 1 sec should be alright, but 6 - 8 definitely isn't


From one of the video comments: "They (Maria João, conductor, and orchestra) had recorded 6 months earlier three Mozart concerti. This lunch concert was the rehearsal for the evening performance. Since they had rehearsed to prepare the recording six months earlier, this lunch concert was the only rehearsal. She came prepared to play K. 467 and was caught off guard by K. 466 (although it has been in her repertory for years)."


My current top three would be:

uBlock Origin - Block annoying ads

Link: https://chrome.google.com/webstore/detail/ublock-origin/cjpa...

LastPass - Easily manage passwords

Link: https://chrome.google.com/webstore/detail/lastpass-free-pass...

The Great Suspender - Save RAM by suspending unused tabs

Link: https://chrome.google.com/webstore/detail/the-great-suspende...


I've found OneTab to be a better idea than Suspender personally, plus it lets me look back at all the stuff I did have open during a session for quick recall.


Similar list here, but I'd add Ghostery and Google Hangouts to the list. Along with React and Redux dev tools if you use them.


Use PrivacyBadger, not Ghostery. Ghostery let's some stuff through.


Will take a look, though some stuff I want through (mainly disqus, etc).


LastPass is known to have... "problems".


It isn't great, and they did have a leak, but it is the best option if you're working across platforms (including your phone). And peering options are at least there. I've considered switching out, but have been relatively happy.



Website is a complete mess and the page you refer to was probably about the previous model which was withdrawn. Anyway, after some search I managed to find it here in Spain and even order one (they have a nice discount these days due to Black Friday). This one should make it for you: http://www.dell.com/de/unternehmen/p/xps-13-9360-laptop/pd?o...



Should be noted in the title that this post is from 2012


I'm in a similar situation to yours and been reading some positive reviews about this https://slimbook.es/

They seem to be very responsive in terms of doubts, questions, etc. but as most small companies their main disadvantage is brand awareness when compared to Dell or Apple. But of course this is greatly compensated by a cheaper price.


That looks like a MacBook Air clone.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: