To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥??偃??猥れ?嚥??偃??猥れ?^ 100110101000101100111111001111111001100011101110001111110011111111100000110011101000001011101010001111111001101010001011001111110011111110011000111011100011111100111111111000001100111010000010111010100011111101011110 9a8b3f3f98ee3f3fe0ce82ea3f9a8b3f3f98ee3f3fe0ce82ea3f5e
EUC-JP 嚥??偃??猥れ?嚥??偃??猥れ?^ 110100111110101100111111001111111101000011110000001111110011111111100000110100001010010011101100001111111101001111101011001111110011111111010000111100000011111100111111111000001101000010100100111011000011111101011110 d3eb3f3fd0f03f3fe0d0a4ec3fd3eb3f3fd0f03f3fe0d0a4ec3f5e
UTF-8 嚥잙젇偃띾젽猥れ뙣嚥잙젇偃띾젽猥れ뙟^ 11100101100110101010010111101100100111101001100111101100101000001000011111100101100000011000001111101011100111011011111011101100101000001011110111100111100011001010010111100011100000101000110011101011100110011010001111100101100110101010010111101100100111101001100111101100101000001000011111100101100000011000001111101011100111011011111011101100101000001011110111100111100011001010010111100011100000101000110011101011100110011001111101011110 e59aa5ec9e99eca087e58183eb9dbeeca0bde78ca5e3828ceb99a3e59aa5ec9e99eca087e58183eb9dbeeca0bde78ca5e3828ceb999f5e
UHC 嚥잙젇偃띾젽猥れ뙣嚥잙젇偃띾젽猥れ뙟^ 11100110101111111001111111101011101000001000101011100101111001111000110111101011101000001010111111101000111001011010101011101100100011001010100011100110101111111001111111101011101000001000101011100101111001111000110111101011101000001010111111101000111001011010101011101100100011001010010001011110 e6bf9feba08ae5e78deba0afe8e5aaec8ca8e6bf9feba08ae5e78deba0afe8e5aaec8ca45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)