To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????{^ 0011111100111111001111110011111101111101001111110011111100111111001111110111101101011110 3f3f3f3f7d3f3f3f3f7b5e
SJIS-WIN 薔?蔚?}薔?蔚?{^ 111001010100101100111111100010010101010100111111011111011110010101001011001111111000100101010101001111110111101101011110 e54b3f89553f7de54b3f89553f7b5e
EUC-JP 薔?蔚?}薔?蔚?{^ 111010011010110000111111101100011011011000111111011111011110100110101100001111111011000110110110001111110111101101011110 e9ac3fb1b63f7de9ac3fb1b63f7b5e
UTF-8 薔렟蔚턴}薔렟蔚턴{^ 111010001001011010010100111010111010000010011111111010001001010010011010111011011000010010110100011111011110100010010110100101001110101110100000100111111110100010010100100110101110110110000100101101000111101101011110 e89694eba09fe8949aed84b47de89694eba09fe8949aed84b47b5e
UHC 薔렟蔚턴}薔렟蔚턴{^ 11101101111110011000111010110000111010101010010111000101110011110111110111101101111110011000111010110000111010101010010111000101110011110111101101011110 edf98eb0eaa5c5cf7dedf98eb0eaa5c5cf7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)