To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 臧??率??箋 11100100011010000011111100111111100101111010011000111111001111111110001010110011 e4683f3f97a63f3fe2b3
EUC-JP 臧??率??箋 11100111110010010011111100111111110011101010100000111111001111111110010010110101 e7c93f3fcea83f3fe4b5
UTF-8 臧됵풙率멱킊箋 111010001000011110100111111010111001000010110101111011011001001010011001111001111000111010000111111010111010100110110001111011011000001010001010111001111010111010001011 e887a7eb90b5ed9299e78e87eba9b1ed828ae7ae8b
UHC 臧됵풙率멱킊箋 1110110111110101100010011110111110111110100111001110000111100011101110001110100010110100100101101110111110101000 edf589efbe9ce1e3b8e8b496efa8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)