To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h???????? 001111110011111100111111001111110011111100111111001111110011111100111111011010000011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f
SJIS-WIN 誰則袖棚他息狸足側h誰則袖棚他息狸足 1001001001001110100100011010010110010001101100111001001001001001100100011011110010010001101001111001001001001011100100011010101110010001101001000110100010010010010011101001000110100101100100011011001110010010010010011001000110111100100100011010011110010010010010111001000110101011 924e91a591b3924991bc91a7924b91ab91a468924e91a591b3924991bc91a7924b91ab
EUC-JP 誰則袖棚他息狸足側h誰則袖棚他息狸足 1100001110101111110000101010011111000010101101011100001110101010110000101011111011000010101010011100001110101100110000101010110111000010101001100110100011000011101011111100001010100111110000101011010111000011101010101100001010111110110000101010100111000011101011001100001010101101 c3afc2a7c2b5c3aac2bec2a9c3acc2adc2a668c3afc2a7c2b5c3aac2bec2a9c3acc2ad
UTF-8 誰則袖棚他息狸足側h誰則袖棚他息狸足 11101000101010101011000011100101100010011000011111101000101000101001011011100110101000111001101011100100101110111001011011100110100000011010111111100111100010111011100011101000101101101011001111100101100000011011010001101000111010001010101010110000111001011000100110000111111010001010001010010110111001101010001110011010111001001011101110010110111001101000000110101111111001111000101110111000111010001011011010110011 e8aab0e58987e8a296e6a39ae4bb96e681afe78bb8e8b6b3e581b468e8aab0e58987e8a296e6a39ae4bb96e681afe78bb8e8b6b3
UHC 誰則袖棚他息狸足側h誰則袖棚他息狸足 1110001011000001111101101100111011100010110000001101110111011100111101101110001011100011110100111101011111100001111100001110101111110110101100000110100011100010110000011111011011001110111000101100000011011101110111001111011011100010111000111101001111010111111000011111000011101011 e2c1f6cee2c0dddcf6e2e3d3d7e1f0ebf6b068e2c1f6cee2c0dddcf6e2e3d3d7e1f0eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)