To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ???孫賦???孫鳧E 001111110011111100111111100100011011011110010101100010100011111100111111001111111001000110110111111010011110100001000101 3f3f3f91b7958a3f3f3f91b7e9e845
EUC-JP ???孫賦???孫鳧E 001111110011111100111111110000101011100111001001111010100011111100111111001111111100001010111001111100101110101001000101 3f3f3fc2b9c9ea3f3f3fc2b9f2ea45
UTF-8 뤞첁쇽孫賦뤞첁쇽孫鳧E 11101011101001001001111011101100101100101000000111101100100001111011110111100101101011011010101111101000101100111010011011101011101001001001111011101100101100101000000111101100100001111011110111100101101011011010101111101001101100111010011101000101 eba49eecb281ec87bde5adabe8b3a6eba49eecb281ec87bde5adabe9b3a745
UHC 뤞첁쇽孫賦뤞첁쇽孫鳧E 100011111100110110101010100011101011110011101111111000011101110111011101101101111000111111001101101010101000111010111100111011111110000111011101110111011100000001000101 8fcdaa8ebcefe1ddddb78fcdaa8ebcefe1ddddc045

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)