I’m having trouble understanding how to run the bulk loader, I have a python script that creates a new dgraph-zero and creates a schema. From there, I’m creating a volume. That is my current understanding of how I should be setting up for the bulk loader.
Finally, I’m using a docker run for the bulk loader. I was wondering what the proper format was for the bulk loader and if I was missing anything?
Welcome to the community. That commands looks correct. If there are no errors, then the posting files created the ./out folder are ready to be put in place when you start your cluster. Have a read of this document, it should clear up next steps: Initial Import (Bulk Loader) - Hypermode
Thank you for confirming. I keep getting an error when I run this command through a powershell command line. I tried looking up the issue and most resources are saying that the error is existing here:
2025/07/28 01:03:44 unlinkat //out: device or resource busy
I was wondering where my most likely points of error would be! Would you have some insight?
The bulk loader creates and deletes a lot of tmp files. Docker on Windows seems to have issues when using “Host bind mounts”. Try using a docker volume instead:
docker volume create dgraph_out
docker run -v dgraph_out:/out <rest of commands>
Then you’ll have to copy the bulk loader output files (posting, etc) from the docker volume to your host system.
Side note: If you’re ultimately going to run the graph on a Linux box, you might consider just running the bulk loader on that system (where you can install the Dgraph CLI) and skip all the Windows quirks.
I see, is there a loading limit for bulk loader? cause I know live loader tries not to go over 15MB per batch, I’m assuming that isn’t the case with Bulk Loader?