To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霤タ釵ュム「ワクシ棈霤タ釵ュム「ワクシ曻 1110100011000011110000001110011111011110101011011101000110100010110111001011100010111100111110100110010011101000110000111100000011100111110111101010110111010001101000101101110010111000101111001111101001100110 e8c3c0e7deadd1a2dcb8bcfa64e8c3c0e7deadd1a2dcb8bcfa66
EUC-JP 霤タ釵ュム「ワクシ棈霤タ釵ュム「ワクシ曻 111100001100010110001110110000001110111011100000100011101010110110001110110100011000111010100010100011101101110010001110101110001000111010111100100011111100001111111100111100001100010110001110110000001110111011100000100011101010110110001110110100011000111010100010100011101101110010001110101110001000111010111100100011111100001010111111 f0c58ec0eee08ead8ed18ea28edc8eb88ebc8fc3fcf0c58ec0eee08ead8ed18ea28edc8eb88ebc8fc2bf
UTF-8 霤タ釵ュム「ワクシ棈霤タ釵ュム「ワクシ曻 111010011001110010100100111011111011111010000000111010011000011110110101111011111011110110101101111011111011111010010001111011111011110110100010111011111011111010011100111011111011110110111000111011111011110110111100111001101010001110001000111010011001110010100100111011111011111010000000111010011000011110110101111011111011110110101101111011111011111010010001111011111011110110100010111011111011111010011100111011111011110110111000111011111011110110111100111001101001101110111011 e99ca4efbe80e987b5efbdadefbe91efbda2efbe9cefbdb8efbdbce6a388e99ca4efbe80e987b5efbdadefbe91efbda2efbe9cefbdb8efbdbce69bbb
UHC ??釵?????????釵??????? 00111111001111111111001111111011001111110011111100111111001111110011111100111111001111110011111100111111111100111111101100111111001111110011111100111111001111110011111100111111 3f3ff3fb3f3f3f3f3f3f3f3f3ff3fb3f3f3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)