To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?館?觸?寥舵??? 10011111110001000011111110001010110110010011111111100110010111000011111110011011100011001001000111000111001111110011111100111111 9fc43f8ad93fe65c3f9b8c91c73f3f3f
EUC-JP 淨?館?觸?寥舵??? 11011110110001100011111110110100110110110011111111101011101111010011111111010101111011001100001011001001001111110011111100111111 dec63fb4db3febbd3fd5ecc2c93f3f3f
UTF-8 淨곌館뤈觸亐寥舵퍘몲뮈 111001101011011110101000111010101011001110001100111010011010010010101000111010111010010010001000111010001010011110111000111001001011101010010000111001011010111110100101111010001000100010110101111011011000110110011000111010111010101010110010111010111010111010001000 e6b7a8eab38ce9a4a8eba488e8a7b8e4ba90e5afa5e888b5ed8d98ebaab2ebae88
UHC 淨곌館뤈觸亐寥舵퍘몲뮈 11101111111001001011000011101010110011101011110110001111101110001111010110111010111010101010011111101000111011111111011011101100101110111000111110111000111101011011100110111111 efe4b0eacebd8fb8f5baeaa7e8eff6ecbb8fb8f5b9bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)