To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?泣f?鎖??語?????純??億 111000011001111110000011100010110011111110001011100000111000001010000110001111111000110110111101001111110011111110001100111010100011111100111111001111110011111100111111100011111000001100111111001111111000100110101101 e19f838b3f8b8382863f8dbd3f3f8cea3f3f3f3f3f8f833f3f89ad
EUC-JP 癲ル?泣f?鎖??語??瑗??純??億 1110001010100001101001011110101100111111101101011110001110100011111001100011111110111010101111110011111100111111101110001110110000111111001111111000111111001100110000000011111100111111101111011110001100111111001111111011001010101111 e2a1a5eb3fb5e3a3e63fbabf3f3fb8ec3f3f8fccc03f3fbde33f3fb2af
UTF-8 癲ル슢泣f룚鎖듦석語ⓦ뀼瑗삣윜純볩폁億 111001111001100110110010111000111000001110101011111011001000101010100010111001101011001110100011111011111011110110000110111010111010001110011010111010011000111010010110111010111001001110100110111011001000010010011101111010001010101010011110111000101001001110100110111010111000000010111100111001111001000110010111111011001000001010100011111011001001110010011100111001111011010010010100111010111011001110101001111011011000111110000001111001011000010010000100 e799b2e383abec8aa2e6b3a3efbd86eba39ae98e96eb93a6ec849de8aa9ee293a6eb80bce79197ec82a3ec9c9ce7b494ebb3a9ed8f81e58484
UHC 癲ル슢泣f룚鎖듦석語ⓦ뀼瑗삣윜純볩폁億 1110111110100110101010111110101110011010101011101110101111101000101000111110011010001111100101101110000111110000101101011110101010111100101011101110010111011110101010001110001110000101101100101110101010111100101110111110010110011111100111111110001011101101100100111110111110111100100100001110010111100010 efa6abeb9aaeebe8a3e68f96e1f0b5eabcaee5dea8e385b2eabcbbe59f9fe2ed93efbc90e5e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)