To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?冀け?冀??冀??冀け?冀??冀?B 001111111001100101100010100000101010111100111111100110010110001000111111001111111001100101100010001111110011111110011001011000101000001010101111001111111001100101100010001111110011111110011001011000100011111101000010 3f996282af3f99623f3f99623f3f996282af3f99623f3f99623f42
EUC-JP ?冀け?冀ʼn?冀ʼn?冀け?冀ʼn?冀ʼnB 0011111111010001110000111010010010110001001111111101000111000011100011111010100111001010001111111101000111000011100011111010100111001010001111111101000111000011101001001011000100111111110100011100001110001111101010011100101000111111110100011100001110001111101010011100101001000010 3fd1c3a4b13fd1c38fa9ca3fd1c38fa9ca3fd1c3a4b13fd1c38fa9ca3fd1c38fa9ca42
UTF-8 룵冀け룵冀ʼn룶冀ʼn룵冀け룵冀ʼn룶冀ʼnB 111010111010001110110101111001011000011010000000111000111000000110010001111010111010001110110101111001011000011010000000110001011000100111101011101000111011011011100101100001101000000011000101100010011110101110100011101101011110010110000110100000001110001110000001100100011110101110100011101101011110010110000110100000001100010110001001111010111010001110110110111001011000011010000000110001011000100101000010 eba3b5e58680e38191eba3b5e58680c589eba3b6e58680c589eba3b5e58680e38191eba3b5e58680c589eba3b6e58680c58942
UHC 룵冀け룵冀ʼn룶冀ʼn룵冀け룵冀ʼn룶冀ʼnB 10001111101010101101000011101101101010101011000110001111101010101101000011101101101010011011000010001111101010111101000011101101101010011011000010001111101010101101000011101101101010101011000110001111101010101101000011101101101010011011000010001111101010111101000011101101101010011011000001000010 8faad0edaab18faad0eda9b08fabd0eda9b08faad0edaab18faad0eda9b08fabd0eda9b042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)