To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 閻??猶??幽??n}閻??猶??幽??n{^ 1110100010000101001111110011111110010111010100000011111100111111100101110100100000111111001111110110111001111101111010001000010100111111001111111001011101010000001111110011111110010111010010000011111100111111011011100111101101011110 e8853f3f97503f3f97483f3f6e7de8853f3f97503f3f97483f3f6e7b5e
EUC-JP 閻??猶??幽??n}閻??猶??幽??n{^ 1110111111100101001111110011111111001101101100010011111100111111110011011010100100111111001111110110111001111101111011111110010100111111001111111100110110110001001111110011111111001101101010010011111100111111011011100111101101011110 efe53f3fcdb13f3fcda93f3f6e7defe53f3fcdb13f3fcda93f3f6e7b5e
UTF-8 閻띻괜猶묌슆幽딅쥛n}閻띻괜猶묌슆幽딅쥛n{^ 1110100110010110101110111110101110011101101110111110101010110100100111001110011110001100101101101110101110101100100011001110110010001010100001101110010110111001101111011110101110010100100001011110110010100101100110110110111001111101111010011001011010111011111010111001110110111011111010101011010010011100111001111000110010110110111010111010110010001100111011001000101010000110111001011011100110111101111010111001010010000101111011001010010110011011011011100111101101011110 e996bbeb9dbbeab49ce78cb6ebac8cec8a86e5b9bdeb9485eca59b6e7de996bbeb9dbbeab49ce78cb6ebac8cec8a86e5b9bdeb9485eca59b6e7b5e
UHC 閻띻괜猶묌슆幽딅쥛n}閻띻괜猶묌슆幽딅쥛n{^ 1110011110100010100011011110101010110001101001101110101110100010100100011110100110011010100110001110101011101011100010101110101110100010100100000110111001111101111001111010001010001101111010101011000110100110111010111010001010010001111010011001101010011000111010101110101110001010111010111010001010010000011011100111101101011110 e7a28deab1a6eba291e99a98eaeb8aeba2906e7de7a28deab1a6eba291e99a98eaeb8aeba2906e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)