To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????酉??奄???ュ?巡??沃?? 111000011001111100111111001111110011111100111111001111111001001111010001001111110011111110001001100000100011111100111111001111111000001110000101001111111000111110000100001111110011111110010111100000000011111100111111 e19f3f3f3f3f3f93d13f3f89823f3f3f83853f8f843f3f97803f3f
EUC-JP 癲?????酉??奄???ュ?巡??沃?? 111000101010000100111111001111110011111100111111001111111100011011010011001111110011111110110001111000100011111100111111001111111010010111100101001111111011110111100100001111110011111111001101111000000011111100111111 e2a13f3f3f3f3fc6d33f3fb1e23f3f3fa5e53fbde43f3fcde03f3f
UTF-8 癲용씭留㏛틠酉몄뿉奄멸랩溜ュ슖巡볥쿋沃쇳똼 111001111001100110110010111011001001101010101001111011001001010010101101111011111010011110001101111000111000111110011011111011011000101110100000111010011000010110001001111010111010101010000100111010111011111110001001111001011010010110000100111010111010100110111000111010111001111010101001111011111010011110001011111000111000001110100101111011001000101010010110111001011011011110100001111010111011001110100101111011001011111110001011111001101011001010000011111011001000011110110011111010111001100010111100 e799b2ec9aa9ec94adefa78de38f9bed8ba0e98589ebaa84ebbf89e5a584eba9b8eb9ea9efa78be383a5ec8a96e5b7a1ebb3a5ecbf8be6b283ec87b3eb98bc
UHC 癲용씭留㏛틠酉몄뿉奄멸랩溜ュ슖巡볥쿋沃쇳똼 111011111010011010111111111010111001110110111110111010111010011110100111111001001011101010001100111010111011011110111000111011001001011110010000111001011111001010111000111010101011011110100110111010101111111010101011111001011001101010100101111000101101111010010011111010111011001010100000111010001010101010111100111011011000110010000010 efa6bfeb9dbeeba7a7e4ba8cebb7b8ec9790e5f2b8eab7a6eafeabe59aa5e2de93ebb2a0e8aabced8c82

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)