To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 櫻??誼?????揶??異э????節??? 10011111010011100011111100111111100010110110001000111111001111110011111100111111001111111001110110001000001111110011111110001000110110011000010010001111001111110011111100111111001111111001000011011111001111110011111100111111 9f4e3f3f8b623f3f3f3f3f9d883f3f88d9848f3f3f3f3f90df3f3f3f
EUC-JP 櫻??誼??洧??揶??異э????節??? 110111011010111100111111001111111011010111000011001111110011111110001111110001111011010000111111001111111101100111101000001111110011111110110000110110111010011111101111001111110011111100111111001111111100000011100001001111110011111100111111 ddaf3f3fb5c33f3f8fc7b43f3fd9e83f3fb0dba7ef3f3f3f3fc0e13f3f3f
UTF-8 櫻뗰퐢誼숂뛾洧띠몴揶쏆떔異э쭓六롥죰節뗭젟劣 1110011010101011101110111110101110010111101100001110110110010000101000101110100010101010101111001110110010001000100000101110101110011011101111101110011010110100101001111110101110011101101000001110101110101010101101001110011010001111101101101110110010001111100001101110101110010110100101001110011110010101101100001101000110001101111011001010110110010011111011111010011110010001111010111010000110100101111011001010001110110000111001111010111110000000111010111001011110101101111011001010000010011111111011111010011010011101 e6abbbeb97b0ed90a2e8aabcec8882eb9bbee6b4a7eb9da0ebaab4e68fb6ec8f86eb9694e795b0d18decad93efa791eba1a5eca3b0e7af80eb97adeca09fefa69d
UHC 櫻뗰퐢誼숂뛾洧띠몴揶쏆떔異э쭓六롥죰節뗭젟劣 1110010110100001100010111110111110111101100010111110101111111110100110011110011110001101100001001110101011111011101101101110110010010001100111001110010110101010100110111110110010001011101010101110110010110110101011001110111110100111100010111110101110111011100011101110010110100001100010111110111110111101100010111110110010100000100110011110011011101011 e5a18befbd8bebfe99e78d84eafbb6ec919ce5aa9bec8baaecb6acefa78bebbb8ee5a18befbd8beca099e6eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)