To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????肉??猥 001111110011111100111111001111110011111100111111100100111111011100111111001111111110000011001110 3f3f3f3f3f3f93f73f3fe0ce
EUC-JP 薏?????肉??猥 1000111111011001110111100011111100111111001111110011111100111111110001101111100100111111001111111110000011010000 8fd9de3f3f3f3f3fc6f93f3fe0d0
UTF-8 薏꾤죪琉앸뒓肉끹늾猥 111010001001011010001111111010101011111010100100111011001010001110101010111011111010011110001100111011001001010110111000111010111001001010010011111010001000001010001001111010111000000110111001111010111000101010111110111001111000110010100101 e8968feabea4eca3aaefa78cec95b8eb9293e88289eb81b9eb8abee78ca5
UHC 薏꾤죪琉앸뒓肉끹늾猥 1110101111111011100001001110011110100001100001011110101110100100100111011110101110001010100100001110101110111111100001011110001110001000100001111110100011100101 ebfb84e7a185eba49deb8a90ebbf85e38887e8e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)