To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????®???????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111101011100011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3fae3f3f3f3f3f3f3f3f
SJIS-WIN 午??踰??臾??壓??溢??幽??耶??艤 100011001101111100111111001111111110011011111010001111110011111111100100011010110011111100111111100110101101100000111111001111111000100011101100001111110011111110010111010010000011111100111111100101101110101100111111001111111110010001111110 8cdf3f3fe6fa3f3fe46b3f3f9ad83f3f88ec3f3f97483f3f96eb3f3fe47e
EUC-JP 午??踰??臾??壓??溢®?幽??耶??艤 1011100011100001001111110011111111101100111111000011111100111111111001111100110000111111001111111101010011011010001111110011111110110000111011101000111110100010111011100011111111001101101010010011111100111111110011001110110100111111001111111110011111011111 b8e13f3fecfc3f3fe7cc3f3fd4da3f3fb0ee8fa2ee3fcda93f3fcced3f3fe7df
UTF-8 午닿퓥踰곻쭛臾딄퉿壓믪궛溢®솾幽덈뼠耶븐슌艤 1110010110001101100010001110101110001011101111111110110110010011101001011110100010111000101100001110101010110011101110111110110010101101100110111110100010000111101111101110101110010100100001001110110110001001101111111110010110100011100100111110101110101111101010101110101010110110100110111110011010111010101000101100001010101110111011001000011010111110111001011011100110111101111010111000110110001000111010111011110010100000111010001000000010110110111010111011100010010000111011001000101010001100111010001000100110100100 e58d88eb8bbfed93a5e8b8b0eab3bbecad9be887beeb9484ed89bfe5a393ebafaaeab69be6baa2c2aeec86bee5b9bdeb8d88ebbca0e880b6ebb890ec8a8ce889a4
UHC 午닿퓥踰곻쭛臾딄퉿壓믪궛溢®솾幽덈뼠耶븐슌艤 1110011111101101101101001110101010111111100011101110101110110010100000011110111110100111100100011110101110101100100010101110101010111001100101111110010011100010100100101110110010000010101100001110110011101110101000101110011110011001101100101110101011101011100010001110101110010110101000111110010110101101101110101110110010011010100111001110101111111010 e7edb4eabf8eebb281efa791ebac8aeab997e4e292ec82b0eceea2e799b2eaeb88eb96a3e5adbaec9a9cebfa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)