To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????E 001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f45
SJIS-WIN ?眩???眩??E 0011111111100001101111110011111100111111001111111110000110111111001111110011111101000101 3fe1bf3f3f3fe1bf3f3f45
EUC-JP ?眩???眩?堧E 00111111111000101100000100111111001111110011111111100010110000010011111110001111101110001010100001000101 3fe2c13f3f3fe2c13f8fb8a845
UTF-8 뤺眩㎱曆뤺眩㎱堧E 11101011101001001011101011100111100111001010100111100011100011101011000111101111101001101000101111101011101001001011101011100111100111001010100111100011100011101011000111100101101000001010011101000101 eba4bae79ca9e38eb1efa68beba4bae79ca9e38eb1e5a0a745
UHC 뤺眩㎱曆뤺眩㎱堧E 1000111111101000111110101101111110100111110000001110011010110111100011111110100011111010110111111010011111000000111001101100000001000101 8fe8fadfa7c0e6b78fe8fadfa7c0e6c045

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)