To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?泣?ⅱ應?┛汚??異??怨??五 1110000110011111100000111000101100111111100010111000001100111111111110100100000110011100111001000011111110000100101011101000100110011000001111110011111110001000110110010011111100111111100010011000010100111111001111111000110011011100 e19f838b3f8b833ffa419ce43f84ae89983f3f88d93f3f89853f3f8cdc
EUC-JP 癲ル?泣??應?┛汚??異??怨??五 11100010101000011010010111101011001111111011010111100011001111110011111111011000111001100011111110101000101100001011000111111000001111110011111110110000110110110011111100111111101100011110010100111111001111111011100011011110 e2a1a5eb3fb5e33f3fd8e63fa8b0b1f83f3fb0db3f3fb1e53f3fb8de
UTF-8 癲ル슢泣€ⅱ應쇰┛汚살쉸異면댖怨쀬뒠五 111001111001100110110010111000111000001110101011111011001000101010100010111001101011001110100011111000101000001010101100111000101000010110110001111001101000011110001001111011001000011110110000111000101001010010011011111001101011000110011010111011001000001010110100111011001000100110111000111001111001010110110000111010111010100110110100111010111000110010010110111001101000000010101000111011001000000010101100111010111001001010100000111001001011101010010100 e799b2e383abec8aa2e6b3a3e282ace285b1e68789ec87b0e2949be6b19aec82b4ec89b8e795b0eba9b4eb8c96e680a8ec80aceb92a0e4ba94
UHC 癲ル슢泣€ⅱ應쇰┛汚살쉸異면댖怨쀬뒠五 1110111110100110101010111110101110011010101011101110101111101000101000101110011010100101101000101110101111101011101111001110101110100110101100001110011111111101101110111110110010011010100011101110110010110110101110001110100110001000101110101110101010110011100101111110110010001010100111001110011111101001 efa6abeb9aaeebe8a2e6a5a2ebebbceba6b0e7fdbbec9a8eecb6b8e988baeab397ec8a9ce7e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)