[subset] Add unit test for str de-dup Also move the implementation of some methods from the .cc to the .hh