To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?遊??儀??巍ル???┠釉??? 111000011001111110000011100010110011111110010111010101100011111100111111100010110101011000111111001111111001101111011001100000111000101100111111001111110011111110000100101101011110011111010110001111110011111100111111 e19f838b3f97563f3f8b563f3f9bd9838b3f3f3f84b5e7d63f3f3f
EUC-JP 癲ル?遊??儀??巍ル???┠釉??? 111000101010000110100101111010110011111111001101101101110011111100111111101101011011011100111111001111111101011011011011101001011110101100111111001111110011111110101000101101111110111011011000001111110011111100111111 e2a1a5eb3fcdb73f3fb5b73f3fd6dba5eb3f3f3fa8b7eed83f3f3f
UTF-8 癲ル슡遊뉒땟儀뤿짎巍ル뜄柳묕┠釉먯뒭娛 111001111001100110110010111000111000001110101011111011001000101010100001111010011000000110001010111010111000100110010010111010111001010110011111111001011000010010000000111010111010010010111111111011001010011110001110111001011011011110001101111000111000001110101011111010111001110010000100111011111010011110001001111010111010110010010101111000101001010010100000111010011000011110001001111010111010100010101111111010111001001010101101111001011010100010011011 e799b2e383abec8aa1e9818aeb8992eb959fe58480eba4bfeca78ee5b78de383abeb9c84efa789ebac95e294a0e98789eba8afeb92ade5a89b
UHC 癲ル슡遊뉒땟儀뤿짎巍ル뜄柳묕┠釉먯뒭娛 1110111110100110101010111110101110011010101011011110101110110100100001111110011110110110101011011110101111110000100011111110101110100011100110101110100011100100101010111110101110001101100010001110101011110111100100011110111110100110101101111110101110111000100100001110110010001010101001101110011111110100 efa6abeb9aadebb487e7b6adebf08feba39ae8e4abeb8d88eaf791efa6b7ebb890ec8aa6e7f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)