I've attempted to create an MCP server using AWS Lambda with graphrag libraries. I've used the Lambda layers instead of docker containers for lower cold-start times. However the dependencies required for graphrag are extremely large that they cant even fit into AWS Lambda layers and hit limits. Spacy alone is 500 MB and llama-index-core brings a large amount of dependencies. It would be great to be able to make the retrieval a lot more light-weight for environments like AWS Lambda.
I've attempted to create an MCP server using AWS Lambda with graphrag libraries. I've used the Lambda layers instead of docker containers for lower cold-start times. However the dependencies required for graphrag are extremely large that they cant even fit into AWS Lambda layers and hit limits. Spacy alone is 500 MB and llama-index-core brings a large amount of dependencies. It would be great to be able to make the retrieval a lot more light-weight for environments like AWS Lambda.