To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN ?鴦頸?R?鴦頸?^[?鴦頸?R?鴦頸?^[^ 00111111111010011111000111101000111100100011111101010010001111111110100111110001111010001111001000111111010111100101101100111111111010011111000111101000111100100011111101010010001111111110100111110001111010001111001000111111010111100101101101011110 3fe9f1e8f23f523fe9f1e8f23f5e5b3fe9f1e8f23f523fe9f1e8f23f5e5b5e
EUC-JP ?鴦頸?R?鴦頸?^[?鴦頸?R?鴦頸?^[^ 00111111111100101111001111110000111101000011111101010010001111111111001011110011111100001111010000111111010111100101101100111111111100101111001111110000111101000011111101010010001111111111001011110011111100001111010000111111010111100101101101011110 3ff2f3f0f43f523ff2f3f0f43f5e5b3ff2f3f0f43f523ff2f3f0f43f5e5b5e
UTF-8 뤙鴦頸뫙R뤙鴦頸뫙^[뤙鴦頸뫙R뤙鴦頸뫙^[^ 11101011101001001001100111101001101101001010011011101001101000001011100011101011101010111001100101010010111010111010010010011001111010011011010010100110111010011010000010111000111010111010101110011001010111100101101111101011101001001001100111101001101101001010011011101001101000001011100011101011101010111001100101010010111010111010010010011001111010011011010010100110111010011010000010111000111010111010101110011001010111100101101101011110 eba499e9b4a6e9a0b8ebab9952eba499e9b4a6e9a0b8ebab995e5beba499e9b4a6e9a0b8ebab9952eba499e9b4a6e9a0b8ebab995e5b5e
UHC 뤙鴦頸뫙R뤙鴦頸뫙^[뤙鴦頸뫙R뤙鴦頸뫙^[^ 100011111100100011100100111011001100110011110010101110001111110101010010100011111100100011100100111011001100110011110010101110001111110101011110010110111000111111001000111001001110110011001100111100101011100011111101010100101000111111001000111001001110110011001100111100101011100011111101010111100101101101011110 8fc8e4ecccf2b8fd528fc8e4ecccf2b8fd5e5b8fc8e4ecccf2b8fd528fc8e4ecccf2b8fd5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)