To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 閹馴雫竕ヲ逞尿 11101000100001101001001111101001100011101011010011100010100010011010011011100111100101111001010001000001 e88693e98eb4e289a6e7979441
EUC-JP 閹馴雫竕ヲ逞尿 1110111111100110110001101110101110111100101101101110001111101001100011101010011011101101111101111100011110100010 efe6c6ebbcb6e3e98ea6edf7c7a2
UTF-8 閹馴雫竕ヲ逞尿 111010011001011010111001111010011010011010110100111010011001101110101011111001111010101110010101111011111011110110100110111010011000000010011110111001011011000010111111 e996b9e9a6b4e99babe7ab95efbda6e9809ee5b0bf
UHC ?馴???逞尿 00111111111000101111100000111111001111110011111111010110110000011101001011100011 3fe2f83f3f3fd6c1d2e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)