To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???n}???n{^ 0011111100111111001111110110111001111101001111110011111100111111011011100111101101011110 3f3f3f6e7d3f3f3f6e7b5e
SJIS-WIN 鉞伉擴n}鉞伉擴n{^ 1110011111100110100110001100001010011101101100000110111001111101111001111110011010011000110000101001110110110000011011100111101101011110 e7e698c29db06e7de7e698c29db06e7b5e
EUC-JP 鉞伉擴n}鉞伉擴n{^ 1110111011101000110100001100010011011010101100100110111001111101111011101110100011010000110001001101101010110010011011100111101101011110 eee8d0c4dab26e7deee8d0c4dab26e7b5e
UTF-8 鉞伉擴n}鉞伉擴n{^ 1110100110001001100111101110010010111100100010011110011010010011101101000110111001111101111010011000100110011110111001001011110010001001111001101001001110110100011011100111101101011110 e9899ee4bc89e693b46e7de9899ee4bc89e693b46e7b5e
UHC 鉞伉擴n}鉞伉擴n{^ 1110101011000111111110011111001011111100101010100110111001111101111010101100011111111001111100101111110010101010011011100111101101011110 eac7f9f2fcaa6e7deac7f9f2fcaa6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)