To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??肄??臾? 1110101001000000001111110011111111100011111001010011111100111111111001000110101100111111 ea403f3fe3e53f3fe46b3f
EUC-JP 鵝??肄??臾? 1111001110100001001111110011111111100110111001110011111100111111111001111100110000111111 f3a13f3fe6e73f3fe7cc3f
UTF-8 鵝숈뮆肄덆렟臾딅 111010011011010110011101111011001000100010001000111010111010111010000110111010001000001010000100111010111000110110000110111010111010000010011111111010001000011110111110111010111001010010000101 e9b59dec8888ebae86e88284eb8d86eba09fe887beeb9485
UHC 鵝숈뮆肄덆렟臾딅 11100100101111011001100111101100100100101001010111101100101111011000100011101001100011101011000011101011101011001000101011101011 e4bd99ec9295ecbd88e98eb0ebac8aeb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)