To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???怡??碎?? 0011111100111111001111111001110001111101001111110011111111100001111010100011111100111111 3f3f3f9c7d3f3fe1ea3f3f
EUC-JP ??Ł怡??碎?? 00111111001111111000111110101001101010001101011111011110001111110011111111100010111011000011111100111111 3f3f8fa9a8d7de3f3fe2ec3f3f
UTF-8 蓮용Ł怡득떤碎띔덮 1110111110100110100110011110110010011010101010011100010110000001111001101000000010100001111010111001001110011101111010111001011010100100111001111010001010001110111010111001110110010100111010111000110110101110 efa699ec9aa9c581e680a1eb939deb96a4e7a28eeb9d94eb8dae
UHC 蓮용Ł怡득떤碎띔덮 111001101110010110111111111010111010100010101001111011001010111010110101111001101011011010110010111000011110111110110110111010101011010110100100 e6e5bfeba8a9ecaeb5e6b6b2e1efb6eab5a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)