To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 腆律鄙腆律鄙^ 11100100010000011001011110100101111001111011111111100100010000011001011110100101111001111011111101011110 e44197a5e7bfe44197a5e7bf5e
EUC-JP 腆律鄙腆律鄙^ 11100111101000101100111010100111111011101100000111100111101000101100111010100111111011101100000101011110 e7a2cea7eec1e7a2cea7eec15e
UTF-8 腆律鄙腆律鄙^ 11101000100001011000011011100101101111101000101111101001100001001001100111101000100001011000011011100101101111101000101111101001100001001001100101011110 e88586e5be8be98499e88586e5be8be984995e
UHC ?律鄙?律鄙^ 0011111111010111110010001101111010101001001111111101011111001000110111101010100101011110 3fd7c8dea93fd7c8dea95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)