feat: (Catalog) Assets for simple rag application#642
Conversation
|
how about podman runtime? I'm wondering if we can test with podman which is an low hanging thing see how that works with this new prototype? |
I have started working on the code and verified that the image, model and template commands are working fine.. Trying deployment but wanted to start putting the artifacts for review. |
cool, let us try with a bare minimal services just for a PoC and see this goes.. |
Signed-off-by: Yussuf Shaikh <yussuf.shaikh1@ibm.com>
Signed-off-by: Yussuf Shaikh <yussuf.shaikh1@ibm.com>
@mkumatag I have done deployments using architecture and individual services by reusing most of the existing code. Draft code can be referred from (https://github.com/yussufsh/project-ai-services/tree/apis_reuse) Tested application create, delete(no change), template, image, model flow. The info sub-command is pending and should work with validating the info.md file and small code change. |
Signed-off-by: Yussuf Shaikh <yussuf.shaikh1@ibm.com>
|
I have updated the specs to remove |
|
and also explore how can we use same service instruct with different variants like cpu, spyre etc... pushing as a param to the each service |
This will require little more work since we need to propagate the values from bottom to top service. ie. chat service should know the value of instruct cpu param. |
Signed-off-by: Yussuf Shaikh <yussuf.shaikh1@ibm.com>
Just a query, now without podTemplateExecutions, within a service lets say digitization has multiple pods like digitize-service and postgres, then you end up deploying both concurrently right? I hope that is fine |
We should have some reconciliation logic. eg: if digitization needs a DB connection to start, it should wait for it in a loop. If not, the whole service will run only when all the pods are up. |
ideally we need to find a way to control them placing the pods or wait in pod for the DB or have some enough restart counts set etc. how we do it in the openshift like env, this podTemplateExecutions will just add an extra complexity and which may not be required is what my opinion.. will revisit this later if needed :) for now may be not! |
|
IMO: let us merge the bare minimal things and increment things based on the learning otherwise we will just sit on these proposals, PRs forever :) |
Ah good eyes... I see two dirs. The name was changed during implementation, not deleted afterward. Will fix this. |
Signed-off-by: Yussuf Shaikh <yussuf.shaikh1@ibm.com>
Merging this based on another approval. Resolved all the comments.
Please don't depend on the directory conventions, let us add any required details into the spec and use them |
Added two new directories as per #636
This may change in the future, but we are defining the specs for implementing Architectures and Services APIs.
Note: OpenShift runtime specs are not tested yet, just divided based on Bob's suggestion. We need to determine how to deploy charts for each service there.