To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 鳶??熬?R鳶??熬?^[鳶??熬?R鳶??熬?^[^ 1001001111001110001111110011111111100000100100100011111101010010100100111100111000111111001111111110000010010010001111110101111001011011100100111100111000111111001111111110000010010010001111110101001010010011110011100011111100111111111000001001001000111111010111100101101101011110 93ce3f3fe0923f5293ce3f3fe0923f5e5b93ce3f3fe0923f5293ce3f3fe0923f5e5b5e
EUC-JP 鳶??熬?R鳶??熬?^[鳶??熬?R鳶??熬?^[^ 1100011011010000001111110011111111011111111100100011111101010010110001101101000000111111001111111101111111110010001111110101111001011011110001101101000000111111001111111101111111110010001111110101001011000110110100000011111100111111110111111111001000111111010111100101101101011110 c6d03f3fdff23f52c6d03f3fdff23f5e5bc6d03f3fdff23f52c6d03f3fdff23f5e5b5e
UTF-8 鳶멩릫熬뻨R鳶멩릫熬뻨^[鳶멩릫熬뻨R鳶멩릫熬뻨^[^ 11101001101100111011011011101011101010011010100111101011101001101010101111100111100001101010110011101011101110111010100001010010111010011011001110110110111010111010100110101001111010111010011010101011111001111000011010101100111010111011101110101000010111100101101111101001101100111011011011101011101010011010100111101011101001101010101111100111100001101010110011101011101110111010100001010010111010011011001110110110111010111010100110101001111010111010011010101011111001111000011010101100111010111011101110101000010111100101101101011110 e9b3b6eba9a9eba6abe786acebbba852e9b3b6eba9a9eba6abe786acebbba85e5be9b3b6eba9a9eba6abe786acebbba852e9b3b6eba9a9eba6abe786acebbba85e5b5e
UHC 鳶멩릫熬뻨R鳶멩릫熬뻨^[鳶멩릫熬뻨R鳶멩릫熬뻨^[^ 1110011011101001101110001110011010010000100011011110100010100010100101100110111001010010111001101110100110111000111001101001000010001101111010001010001010010110011011100101111001011011111001101110100110111000111001101001000010001101111010001010001010010110011011100101001011100110111010011011100011100110100100001000110111101000101000101001011001101110010111100101101101011110 e6e9b8e6908de8a2966e52e6e9b8e6908de8a2966e5e5be6e9b8e6908de8a2966e52e6e9b8e6908de8a2966e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)