To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 烝?猿狡?弔?乙?烝?猿狡?弔?淫?^ 1110000001111110001111111000100110001110111000001100001000111111100100101010001000111111100010011011001100111111111000000111111000111111100010011000111011100000110000100011111110010010101000100011111110001000111110100011111101011110 e07e3f898ee0c23f92a23f89b33fe07e3f898ee0c23f92a23f88fa3f5e
EUC-JP 烝?猿狡?弔?乙?烝?猿狡?弔?淫?^ 1101111111011111001111111011000111101110111000001100010000111111110001001010010000111111101100101011010100111111110111111101111100111111101100011110111011100000110001000011111111000100101001000011111110110000111111000011111101011110 dfdf3fb1eee0c43fc4a43fb2b53fdfdf3fb1eee0c43fc4a43fb0fc3f5e
UTF-8 烝렒猿狡쨌弔렍乙렊烝렒猿狡쨌弔렍淫렢^ 11100111100000111001110111101011101000001001001011100111100011001011111111100111100010111010000111101100101010001000110011100101101111001001010011101011101000001000110111100100101110011001100111101011101000001000101011100111100000111001110111101011101000001001001011100111100011001011111111100111100010111010000111101100101010001000110011100101101111001001010011101011101000001000110111100110101101111010101111101011101000001010001001011110 e7839deba092e78cbfe78ba1eca88ce5bc94eba08de4b999eba08ae7839deba092e78cbfe78ba1eca88ce5bc94eba08de6b7abeba0a25e
UHC 烝렒猿狡쨌弔렍乙렊烝렒猿狡쨌弔렍淫렢^ 11110001111101101000111010100111111010101011101111001110111010101100001010110111111100001100000010001110101000111110101111100000100011101010000111110001111101101000111010100111111010101011101111001110111010101100001010110111111100001100000010001110101000111110101111100010100011101011001101011110 f1f68ea7eabbceeac2b7f0c08ea3ebe08ea1f1f68ea7eabbceeac2b7f0c08ea3ebe28eb35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)