Every time I reinstall torchani i get this error message about atoms being bytes instead of strings, this easily fixes this. #639

avanteijlingen · 2023-10-12T08:50:51Z

Error:

File ~\anaconda3\lib\site-packages\torchani\data_init_.py:164 in reenterable_iterable_factory
d['species'] = numpy.array([idx[s] for s in d['species']], dtype='i8')

File ~\anaconda3\lib\site-packages\torchani\data_init_.py:164 in
d['species'] = numpy.array([idx[s] for s in d['species']], dtype='i8')

KeyError: b'C'

The proposed fix will allow the program to work wether it parses the atomic labels as bytes or strings by using .decode() within a try-catch

fixes File ~\anaconda3\lib\site-packages\torchani\data\__init__.py:164 in reenterable_iterable_factory d['species'] = numpy.array([idx[s] for s in d['species']], dtype='i8') File ~\anaconda3\lib\site-packages\torchani\data\__init__.py:164 in <listcomp> d['species'] = numpy.array([idx[s] for s in d['species']], dtype='i8') KeyError: b'C'

backwards compatibility

sigmoid scaling between min and max values

yueyericardo · 2023-10-12T13:18:34Z

Hi, thanks for contributing to TorchANI!
Could I know how did you get the error KeyError: b'C' you mentioned?

avanteijlingen · 2023-10-12T16:25:55Z

SAMPLE.zip

When i make a HDF5 dataset and then load it into ANI it always finds the species table to contain the atoms as b'C', b'H' etc which then it doesnt recognise without doing .decode().

I make the HDF5 datasets always similar to this:

mol, E, C, S, F = [],[],[],[],[]

mol.append(HDF5_Dataset.create_group(groupname))

E.append(mol[-1].create_dataset("energies", (energies.shape[0],), dtype='float64'))
E[-1][()] = energies
C.append(mol[-1].create_dataset("coordinates", Conformers.shape, dtype='float64'))
C[-1][()] = Conformers

species = np.array(species.split(), dtype="<U2")
species = np.array(species, dtype = h5py.special_dtype(vlen=str) )

S.append(mol[-1].create_dataset("species", data=species))
atom_types = np.unique(np.hstack((atom_types, species)))

addition decoding of bytes elements

Consider charge

do not try to decode the charge value

changes for charged AEV

handle ase one molecule aev

This reverts commit 07be11c.

avanteijlingen added 2 commits October 12, 2023 08:47

Update __init__.py

53e7eb8

backwards compatibility

avanteijlingen requested review from yueyericardo, IgnacioJPickering and farhadrgh as code owners October 12, 2023 08:50

Update nn.py

5e48ea6

sigmoid scaling between min and max values

Update nn.py

ad92f8f

avanteijlingen added 21 commits October 17, 2023 16:48

Update __init__.py

fe98c39

addition decoding of bytes elements

Update README.md

2fa293e

Update .gitignore

848eadc

Update aev.py

6911d5a

Consider charge

Update _pyanitools.py

eaa31ac

do not try to decode the charge value

Update aev.py

3a9f32d

Update aev.py

84408d8

changes for charged AEV

Update aev.py

90d0696

handle ase one molecule aev

account for charge properly (repeat_interleave)

5e29316

blank lines

07be11c

Revert "blank lines"

6a6e680

This reverts commit 07be11c.

removed parts we dont need

429951d

trains to 2 datasets

fb55076

expand to any number of datasets

2e91727

removed resources

a1db30f

Fixed testing values measurements

007eb86

Update .gitignore

9298624

Update Logger.py

cb2712a

N outputs in atomic_energies

dd88a88

Create MakeJointDataset.py

d1bae39

test cubane

1e8d7db

avanteijlingen added 17 commits March 17, 2024 23:12

Create TrainingAnalysis.py

6440565

Update TrainTANY.py

8e2326e

Delete .clang-format

bc3dd4f

Update .gitignore

1ca0f7d

Update Logger.py

b1a7e2b

Update TrainTANY.py

990a06b

reference DFT results for cubane

f9971b1

Update TrainTANY.py

49a3f20

Update nn.py

27319a6

Update .gitignore

e331d28

Create Testing.py

332ba5f

Update Testing.py

a101393

Update TrainTANY.py

6c9db69

Update Logger.py

536e8b0

Create ANI.out

949e496

Update TrainingAnalysis.py

230252a

Update TrainTANY.py

9aa0dda

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Every time I reinstall torchani i get this error message about atoms being bytes instead of strings, this easily fixes this. #639

Every time I reinstall torchani i get this error message about atoms being bytes instead of strings, this easily fixes this. #639

avanteijlingen commented Oct 12, 2023

yueyericardo commented Oct 12, 2023

avanteijlingen commented Oct 12, 2023

Every time I reinstall torchani i get this error message about atoms being bytes instead of strings, this easily fixes this. #639

Are you sure you want to change the base?

Every time I reinstall torchani i get this error message about atoms being bytes instead of strings, this easily fixes this. #639

Conversation

avanteijlingen commented Oct 12, 2023

yueyericardo commented Oct 12, 2023

avanteijlingen commented Oct 12, 2023