To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?肉ょ?矣???〓?異?┠釉??沃 1110000110011111100000111000101100111111100100111111011110000010111001010011111111100001111000010011111100111111001111111000000110101100001111111000100011011001001111111000010010110101111001111101011000111111001111111001011110000000 e19f838b3f93f782e53fe1e13f3f3f81ac3f88d93f84b5e7d63f3f9780
EUC-JP 癲ル?肉ょ?矣??璵〓?異?┠釉??沃 11100010101000011010010111101011001111111100011011111001101001001110011100111111111000101110001100111111001111111000111111001100111001101010001010101110001111111011000011011011001111111010100010110111111011101101100000111111001111111100110111100000 e2a1a5eb3fc6f9a4e73fe2e33f3f8fcce6a2ae3fb0db3fa8b7eed83f3fcde0
UTF-8 癲ル슢肉ょ뵳矣섏묾璵〓끃異뤄┠釉띾쐠沃 111001111001100110110010111000111000001110101011111011001000101010100010111010001000001010001001111000111000001010000111111010111011010110110011111001111001111110100011111011001000010010001111111010111010110010111110111001111001001010110101111000111000000010010011111010111000000110000011111001111001010110110000111010111010010010000100111000101001010010100000111010011000011110001001111010111001110110111110111011001001000010100000111001101011001010000011 e799b2e383abec8aa2e88289e38287ebb5b3e79fa3ec848febacbee792b5e38093eb8183e795b0eba484e294a0e98789eb9dbeec90a0e6b283
UHC 癲ル슢肉ょ뵳矣섏묾璵〓끃異뤄┠釉띾쐠沃 1110111110100110101010111110101110011010101011101110101110111111101010101110011110010100101100011110101111111000100110001110110010111001101100101110011010100101101000011110101110000101101110011110110010110110101101111110111110100110101101111110101110111000100011011110101110011100100001101110100010101010 efa6abeb9aaeebbfaae794b1ebf898ecb9b2e6a5a1eb85b9ecb6b7efa6b7ebb88deb9c86e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)