To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??瑜??円 00111111001111110011111110001011100000110011111100111111111000001110111100111111001111111000100101111110 3f3f3f8b833f3fe0ef3f3f897e
EUC-JP ???泣??瑜??円 00111111001111110011111110110101111000110011111100111111111000001111000100111111001111111011000111011111 3f3f3fb5e33f3fe0f13f3fb1df
UTF-8 蓮용㉡泣됧죰瑜낆뒌円 111011111010011010011001111011001001101010101001111000111000100110100001111001101011001110100011111010111001000010100111111011001010001110110000111001111001000110011100111010111000001010000110111010111001001010001100111001011000011010000110 efa699ec9aa9e389a1e6b3a3eb90a7eca3b0e7919ceb8286eb928ce58686
UHC 蓮용㉡泣됧죰瑜낆뒌円 1110011011100101101111111110101110101000101100101110101111101000100010011110010110100001100010111110101110100101100001011110110010001010100010011110010111110111 e6e5bfeba8b2ebe889e5a18beba585ec8a89e5f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)