To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???踰?.宥?? 001111110011111100111111111001101111101000111111100000010100010010010111010001110011111100111111 3f3f3fe6fa3f814497473f3f
EUC-JP ???踰?.宥?? 001111110011111100111111111011001111110000111111101000011010010111001101101010000011111100111111 3f3f3fecfc3fa1a5cda83f3f
UTF-8 略녠쑬踰앶.宥룸윹 111011111010010110110110111010111000010110100000111011001001000110101100111010001011100010110000111011001001010110110110111011111011110010001110111001011010111010100101111010111010001110111000111011001001110010111001 efa5b6eb85a0ec91ace8b8b0ec95b6efbc8ee5aea5eba3b8ec9cb9
UHC 略녠쑬踰앶.宥룸윹 111001011011001010110011111010101011111010101000111010111011001010011101111010011010001110101110111010101110100110110111111010111001111110110011 e5b2b3eabea8ebb29de9a3aeeae9b7eb9fb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)