To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 豫ε?五??仰??[豫ε?五??仰??[^ 1001100010101100100000111100001100111111100011001101110000111111001111111000101111000010001111110011111101011011100110001010110010000011110000110011111110001100110111000011111100111111100010111100001000111111001111110101101101011110 98ac83c33f8cdc3f3f8bc23f3f5b98ac83c33f8cdc3f3f8bc23f3f5b5e
EUC-JP 豫ε?五??仰??[豫ε?五??仰??[^ 1101000010101110101001101100010100111111101110001101111000111111001111111011011011000100001111110011111101011011110100001010111010100110110001010011111110111000110111100011111100111111101101101100010000111111001111110101101101011110 d0aea6c53fb8de3f3fb6c43f3f5bd0aea6c53fb8de3f3fb6c43f3f5b5e
UTF-8 豫ε닜五낁뀯仰앲땯[豫ε닜五낁뀯仰앲땯[^ 11101000101100011010101111001110101101011110101110001011100111001110010010111010100101001110101110000010100000011110101110000000101011111110010010111011101100001110110010010101101100101110101110010101101011110101101111101000101100011010101111001110101101011110101110001011100111001110010010111010100101001110101110000010100000011110101110000000101011111110010010111011101100001110110010010101101100101110101110010101101011110101101101011110 e8b1abceb5eb8b9ce4ba94eb8281eb80afe4bbb0ec95b2eb95af5be8b1abceb5eb8b9ce4ba94eb8281eb80afe4bbb0ec95b2eb95af5b5e
UHC 豫ε닜五낁뀯仰앲땯[豫ε닜五낁뀯仰앲땯[^ 111001111110001110100101111001011000100010011101111001111110100110000101111010001000010110100101111001001110011010011101111010001000101110000101010110111110011111100011101001011110010110001000100111011110011111101001100001011110100010000101101001011110010011100110100111011110100010001011100001010101101101011110 e7e3a5e5889de7e985e885a5e4e69de88b855be7e3a5e5889de7e985e885a5e4e69de88b855b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)