To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????釗????????意??飮?B 001111110011111100111111001111110011111100111111111110111011101100111111001111110011111100111111001111110011111100111111001111111000100011010011001111110011111110011111010110100011111101000010 3f3f3f3f3f3ffbbb3f3f3f3f3f3f3f3f88d33f3f9f5a3f42
EUC-JP ??????釗?????薏??意??飮?B 001111110011111100111111001111110011111100111111100011111110001110100110001111110011111100111111001111110011111110001111110110011101111000111111001111111011000011010101001111110011111111011101101110110011111101000010 3f3f3f3f3f3f8fe3a63f3f3f3f3f8fd9de3f3fb0d53f3fddbb3f42
UTF-8 溜삘뵗溜띾졎釗숇젦溜삠뀛薏뽧뀛意썬뀛飮큮B 11101111101001111000101111101100100000101001100011101011101101011001011111101111101001111000101111101011100111011011111011101100101000011000111011101001100001111001011111101100100010001000011111101100101000001010011011101111101001111000101111101100100000101010000011101011100000001001101111101000100101101000111111101011101111011010011111101011100000001001101111100110100001001000111111101100100011011010110011101011100000001001101111101001101000111010111011101101100000011010111001000010 efa78bec8298ebb597efa78beb9dbeeca18ee98797ec8887eca0a6efa78bec82a0eb809be8968febbda7eb809be6848fec8daceb809be9a3aeed81ae42
UHC 溜삘뵗溜띾졎釗숇젦溜삠뀛薏뽧뀛意썬뀛飮큮B 1110101011111110101110111110001010010100100110011110101011111110100011011110101110100000101110111110000111110010100110011110101110100000100111101110101011111110101110111110001110000101100101001110101111111011100101101110001110000101100101001110101111110010101111011110001110000101100101001110101111100110101101000111100101000010 eafebbe29499eafe8deba0bbe1f299eba09eeafebbe38594ebfb96e38594ebf2bde38594ebe6b47942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)