To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ??斐????占 00111111001111111001010011100011001111110011111100111111001111111001000011101000 3f3f94e33f3f3f3f90e8
EUC-JP ??斐????占 00111111001111111100100011100101001111110011111100111111001111111100000011101010 3f3fc8e53f3f3f3fc0ea
UTF-8 렳뤃斐혦훿핊짜占 111010111010000010110011111010111010010010000011111001101001011010010000111011011001100010100110111011011001101110111111111011011001010110001010111011001010011110011100111001011000110110100000 eba0b3eba483e69690ed98a6ed9bbfed958aeca79ce58da0
UHC 렳뤃斐혦훿핊짜占 10001110110000001000111110110100110111011110110011000010100011101100010010001110110000001000111111000010101001011110111110111111 8ec08fb4ddecc28ec48ec08fc2a5efbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)