To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 日???姙?臟 10010011111110100011111100111111001111111001101101001011001111111110010001100110 93fa3f3f3f9b4b3fe466
EUC-JP 日?玎?姙?臟 110001101111110000111111100011111100101111010010001111111101010110101100001111111110011111000111 c6fc3f8fcbd23fd5ac3fe7c7
UTF-8 日렮玎렕姙렔臟 111001101001011110100101111010111010000010101110111001111000111010001110111010111010000010010101111001011010011110011001111010111010000010010100111010001000011110011111 e697a5eba0aee78e8eeba095e5a799eba094e8879f
UHC 日렮玎렕姙렔臟 1110110011101101100011101011101111101111111010011000111010101010111011001111010110001110101010011110110111110100 eced8ebbefe98eaaecf58ea9edf4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)