Focus stacking will only solve the DoF problem.
Not the i'm floating somewhere above, looking down on reality problem.
They used to use boroscopes with very short focal length lenses, move them along models at the proper level and the proper distance to the model, to create more or less convincing images. That really is something you should pursue, if (!) you're aim is to make the images look like they are of real-life-scale thingies, and not a model.
(Used to use, because now everything that used to be done with models is modelled in computer graphics).

