To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 獰??巍?????N}獰??巍?????N{^ 111000001101011000111111001111111001101111011001001111110011111100111111001111110011111101001110011111011110000011010110001111110011111110011011110110010011111100111111001111110011111100111111010011100111101101011110 e0d63f3f9bd93f3f3f3f3f4e7de0d63f3f9bd93f3f3f3f3f4e7b5e
EUC-JP 獰??巍?????N}獰??巍?????N{^ 111000001101100000111111001111111101011011011011001111110011111100111111001111110011111101001110011111011110000011011000001111110011111111010110110110110011111100111111001111110011111100111111010011100111101101011110 e0d83f3fd6db3f3f3f3f3f4e7de0d83f3fd6db3f3f3f3f3f4e7b5e
UTF-8 獰⑸젗巍띾젌廉띾젶N}獰⑸젗巍띾젌廉띾젶N{^ 1110011110001101101100001110001010010001101110001110110010100000100101111110010110110111100011011110101110011101101111101110110010100000100011001110111110100110101000101110101110011101101111101110110010100000101101100100111001111101111001111000110110110000111000101001000110111000111011001010000010010111111001011011011110001101111010111001110110111110111011001010000010001100111011111010011010100010111010111001110110111110111011001010000010110110010011100111101101011110 e78db0e291b8eca097e5b78deb9dbeeca08cefa6a2eb9dbeeca0b64e7de78db0e291b8eca097e5b78deb9dbeeca08cefa6a2eb9dbeeca0b64e7b5e
UHC 獰⑸젗巍띾젌廉띾젶N}獰⑸젗巍띾젌廉띾젶N{^ 1110011110111110101010011110101110100000100100111110100011100100100011011110101110100000100011011110011011110101100011011110101110100000101010100100111001111101111001111011111010101001111010111010000010010011111010001110010010001101111010111010000010001101111001101111010110001101111010111010000010101010010011100111101101011110 e7bea9eba093e8e48deba08de6f58deba0aa4e7de7bea9eba093e8e48deba08de6f58deba0aa4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)