To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????h???? 001111110011111100111111001111110110100000111111001111110011111100111111 3f3f3f3f683f3f3f3f
SJIS-WIN 薔?臟?h薔?臟? 11100101010010110011111111100100011001100011111101101000111001010100101100111111111001000110011000111111 e54b3fe4663f68e54b3fe4663f
EUC-JP 薔?臟?h薔?臟? 11101001101011000011111111100111110001110011111101101000111010011010110000111111111001111100011100111111 e9ac3fe7c73f68e9ac3fe7c73f
UTF-8 薔렟臟렑h薔렟臟렑 11101000100101101001010011101011101000001001111111101000100001111001111111101011101000001001000101101000111010001001011010010100111010111010000010011111111010001000011110011111111010111010000010010001 e89694eba09fe8879feba09168e89694eba09fe8879feba091
UHC 薔렟臟렑h薔렟臟렑 1110110111111001100011101011000011101101111101001000111010100110011010001110110111111001100011101011000011101101111101001000111010100110 edf98eb0edf48ea668edf98eb0edf48ea6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)