Streamlining model uploads

raspberrycup · September 16, 2024, 2:53pm

Hello,

Since uploading large models (e.g., LLaMA) is time-consuming and inefficient, could you enable competitors to upload their assets once, with the assets being copied to a folder that is always mounted in the Docker test environment, for example, under ‘/assets’?

kwetstone · September 17, 2024, 9:38pm

Hi @raspberrycup – good question. Unfortunately we don’t have persistent storage for competitors associated with the code execution environment, so assets have to be mounted each time.

If you’re curious for more details – when a submission is made, our code execution Kubernetes cluster creates an inference pod specifically for that submission. All of the submission assets are mounted to that pod. The pod shuts down after the submission code is done running, and the submission assets aren’t stored elsewhere. This blog post has a longer explanation.

raspberrycup · September 17, 2024, 10:37pm

Hello @kwetstone, thank you for your reply and sharing the details about the competition infrastructure and workflows. What about the possibility of hosting commonly used models in a shared storage for all competitors, similar to Kaggle datasets? According to the blog post you already have some competitor-independent storage and a similar storage could be used for hosting common open models. This greatly enhances the platform’s user experience and their engagement.

kwetstone · September 20, 2024, 2:01pm

@raspberrycup I see your point – I know it’s frustrating waiting for the long uploads each time. It’s unlikely we’ll be able to make that change for the current competition, but we see your point and will consider changing the setup for future competitions. We tend to see many different approaches across solutions, so it would be difficult to determine which common open models to make available within the limits of our competition storage capacity.

If you think of anything else that we could to do improve the participant experience, please let us know!

cuongk14 · September 20, 2024, 2:39pm

@kwetstone I would like to clarify that even there is no Internet connection while executing our submission, we can still load/download model from HuggingFace. We do not need to include a very big model in our submission ?

kwetstone · September 20, 2024, 2:55pm

@cuongk14 There is no access to the internet while executing your submission, so your code will not be able to download models from HuggingFace. To use an open-source model, you’ll need to download the pretrained weights and include those weights in your submission. The benchmark blog post walks through how to do this.

wdong · October 9, 2024, 7:40pm

Since initializing the k8s environment is simply unzipping the zip file, it would be in principle possible to allow a submission to be initialized from another submission. That is, if a new submission refers to a previous submission, you simply unzip the previous zip, and then the new one, this way it’s possible for one to only overwrite part of an existing submission. Something like the “from” line of Dockerfiles.

Many times one just wants to fix a bug in the code or change the inference logic. Allowing uploading a delta would substantially save the server storage and network resources and make the competition more fun.

kwetstone · October 9, 2024, 8:19pm

Unfortunately previous submissions to not persist in storage anywhere after they are executed in a pod, so it would not be possible to initialize from a previous submission.

My recommendation would be test locally while making small tweaks and fixing bugs, which will be faster than submitting to the competition platform.

Topic		Replies	Views
API to upload Large Submissoin File from Cloud or Remote On Cloud N	1	434	December 7, 2021
Empty data folder on Docker and upload at 0% Youth Mental Health: Automated Abstraction	3	91	September 28, 2024
Submission runs fine locally but smoketest fials Youth Mental Health: Automated Abstraction	5	62	November 20, 2024
Submission problems SNOMED CT Entity Linking	1	160	February 16, 2024
Access to runtime code output and artifacts from evaluation stage Water Supply Forecast Rodeo	1	132	December 15, 2023

Streamlining model uploads

Related topics