To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???H{????s???G???J???H{ 0011111100111111001111110100100001111011001111110011111100111111001111110111001100111111001111110011111101000111001111110011111100111111010010100011111100111111001111110100100001111011 3f3f3f487b3f3f3f3f733f3f3f473f3f3f4a3f3f3f487b
SJIS-WIN 狸旦捉H{狸旦息綻s狸旦捉G狸旦捉J狸旦捉H{ 100100100100101110010010010101011001000110101000010010000111101110010010010010111001001001010101100100011010011110010010010111010111001110010010010010111001001001010101100100011010100001000111100100100100101110010010010101011001000110101000010010101001001001001011100100100101010110010001101010000100100001111011 924b925591a8487b924b925591a7925d73924b925591a847924b925591a84a924b925591a8487b
EUC-JP 狸旦捉H{狸旦息綻s狸旦捉G狸旦捉J狸旦捉H{ 110000111010110011000011101101101100001010101010010010000111101111000011101011001100001110110110110000101010100111000011101111100111001111000011101011001100001110110110110000101010101001000111110000111010110011000011101101101100001010101010010010101100001110101100110000111011011011000010101010100100100001111011 c3acc3b6c2aa487bc3acc3b6c2a9c3be73c3acc3b6c2aa47c3acc3b6c2aa4ac3acc3b6c2aa487b
UTF-8 狸旦捉H{狸旦息綻s狸旦捉G狸旦捉J狸旦捉H{ 11100111100010111011100011100110100101111010011011100110100011011000100101001000011110111110011110001011101110001110011010010111101001101110011010000001101011111110011110110110101110110111001111100111100010111011100011100110100101111010011011100110100011011000100101000111111001111000101110111000111001101001011110100110111001101000110110001001010010101110011110001011101110001110011010010111101001101110011010001101100010010100100001111011 e78bb8e697a6e68d89487be78bb8e697a6e681afe7b6bb73e78bb8e697a6e68d8947e78bb8e697a6e68d894ae78bb8e697a6e68d89487b
UHC 狸旦捉H{狸旦息綻s狸旦捉G狸旦捉J狸旦捉H{ 110101111110000111010011101010011111001110110101010010000111101111010111111000011101001110101001111000111101001111110111101010100111001111010111111000011101001110101001111100111011010101000111110101111110000111010011101010011111001110110101010010101101011111100001110100111010100111110011101101010100100001111011 d7e1d3a9f3b5487bd7e1d3a9e3d3f7aa73d7e1d3a9f3b547d7e1d3a9f3b54ad7e1d3a9f3b5487b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)