To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????E???L 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100010100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f3f3f3f3f453f3f3f4c
SJIS-WIN 狸旦息綻狸旦息狸則俗狸旦捉E狸旦捉L 10010010010010111001001001010101100100011010011110010010010111011001001001001011100100100101010110010001101001111001001001001011100100011010010110010001101011011001001001001011100100100101010110010001101010000100010110010010010010111001001001010101100100011010100001001100 924b925591a7925d924b925591a7924b91a591ad924b925591a845924b925591a84c
EUC-JP 狸旦息綻狸旦息狸則俗狸旦捉E狸旦捉L 11000011101011001100001110110110110000101010100111000011101111101100001110101100110000111011011011000010101010011100001110101100110000101010011111000010101011111100001110101100110000111011011011000010101010100100010111000011101011001100001110110110110000101010101001001100 c3acc3b6c2a9c3bec3acc3b6c2a9c3acc2a7c2afc3acc3b6c2aa45c3acc3b6c2aa4c
UTF-8 狸旦息綻狸旦息狸則俗狸旦捉E狸旦捉L 1110011110001011101110001110011010010111101001101110011010000001101011111110011110110110101110111110011110001011101110001110011010010111101001101110011010000001101011111110011110001011101110001110010110001001100001111110010010111111100101111110011110001011101110001110011010010111101001101110011010001101100010010100010111100111100010111011100011100110100101111010011011100110100011011000100101001100 e78bb8e697a6e681afe7b6bbe78bb8e697a6e681afe78bb8e58987e4bf97e78bb8e697a6e68d8945e78bb8e697a6e68d894c
UHC 狸旦息綻狸旦息狸則俗狸旦捉E狸旦捉L 11010111111000011101001110101001111000111101001111110111101010101101011111100001110100111010100111100011110100111101011111100001111101101100111011100001110101001101011111100001110100111010100111110011101101010100010111010111111000011101001110101001111100111011010101001100 d7e1d3a9e3d3f7aad7e1d3a9e3d3d7e1f6cee1d4d7e1d3a9f3b545d7e1d3a9f3b54c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)