To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 櫻??油ε?循??也 100111110100111000111111001111111001011011111011100000111100001100111111100011110111101000111111001111111001011011100111 9f4e3f3f96fb83c33f8f7a3f3f96e7
EUC-JP 櫻??油ε?循??也 110111011010111100111111001111111100110011111101101001101100010100111111101111011101101100111111001111111100110011101001 ddaf3f3fccfda6c53fbddb3f3fcce9
UTF-8 櫻뗫렇油ε궟循놃렦也 1110011010101011101110111110101110010111101010111110101110100000100001111110011010110010101110011100111010110101111010101011011010011111111001011011111010101010111010111000011010000011111010111010000010100110111001001011100110011111 e6abbbeb97abeba087e6b2b9ceb5eab69fe5beaaeb8683eba0a6e4b99f
UHC 櫻뗫렇油ε궟循놃렦也 1110010110100001100010111110101110110111101110001110101011111010101001011110010110000010101100101110001011100000100001101110110110001110101101011110010110100101 e5a18bebb7b8eafaa5e582b2e2e086ed8eb5e5a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)