To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 彦????キ???n}彦????キ???n{^ 100101010100011000111111001111110011111100111111100000110100110000111111001111110011111101101110011111011001010101000110001111110011111100111111001111111000001101001100001111110011111100111111011011100111101101011110 95463f3f3f3f834c3f3f3f6e7d95463f3f3f3f834c3f3f3f6e7b5e
EUC-JP 彦????キ???n}彦????キ???n{^ 110010011010011100111111001111110011111100111111101001011010110100111111001111110011111101101110011111011100100110100111001111110011111100111111001111111010010110101101001111110011111100111111011011100111101101011110 c9a73f3f3f3fa5ad3f3f3f6e7dc9a73f3f3f3fa5ad3f3f3f6e7b5e
UTF-8 彦숉쓾溜믥キ溜뀀젣n}彦숉쓾溜믥キ溜뀀젣n{^ 1110010110111101101001101110110010001000100010011110110010010011101111101110111110100111100010111110101110101111101001011110001110000010101011011110111110100111100010111110101110000000100000001110110010100000101000110110111001111101111001011011110110100110111011001000100010001001111011001001001110111110111011111010011110001011111010111010111110100101111000111000001010101101111011111010011110001011111010111000000010000000111011001010000010100011011011100111101101011110 e5bda6ec8889ec93beefa78bebafa5e382adefa78beb8080eca0a36e7de5bda6ec8889ec93beefa78bebafa5e382adefa78beb8080eca0a36e7b5e
UHC 彦숉쓾溜믥キ溜뀀젣n}彦숉쓾溜믥キ溜뀀젣n{^ 1110010111101001100110011110110110011101100110011110101011111110100100101110011110101011101011011110101011111110101100101110101110100000100111000110111001111101111001011110100110011001111011011001110110011001111010101111111010010010111001111010101110101101111010101111111010110010111010111010000010011100011011100111101101011110 e5e999ed9d99eafe92e7abadeafeb2eba09c6e7de5e999ed9d99eafe92e7abadeafeb2eba09c6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)