To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?レ?違??怨??? 00111111100000111000110000111111100010001110000100111111001111111000100110000101001111110011111100111111 3f838c3f88e13f3f89853f3f3f
EUC-JP ?レ?違??怨??? 00111111101001011110110000111111101100001110001100111111001111111011000111100101001111110011111100111111 3fa5ec3fb0e33f3fb1e53f3f3f
UTF-8 曆レ눘違띷텈怨멸쉽料 111011111010011010001011111000111000001110101100111010111000100010011000111010011000000110010101111010111001110110110111111011011000010110001000111001101000000010101000111010111010100110111000111011001000100110111101111011111010011010111110 efa68be383aceb8898e98195eb9db7ed8588e680a8eba9b8ec89bdefa6be
UHC 曆レ눘違띷텈怨멸쉽料 1110011010110111101010111110110010000111101100011110101011011110100011011110011010110110100001011110101010110011101110001110101010111101101100011110100011110111 e6b7abec87b1eade8de6b685eab3b8eabdb1e8f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)