To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???[}???[{^ 0011111100111111001111110101101101111101001111110011111100111111010110110111101101011110 3f3f3f5b7d3f3f3f5b7b5e
SJIS-WIN 汚??[}汚??[{^ 10001001100110000011111100111111010110110111110110001001100110000011111100111111010110110111101101011110 89983f3f5b7d89983f3f5b7b5e
EUC-JP 汚??[}汚??[{^ 10110001111110000011111100111111010110110111110110110001111110000011111100111111010110110111101101011110 b1f83f3f5b7db1f83f3f5b7b5e
UTF-8 汚띌쪒[}汚띌쪒[{^ 1110011010110001100110101110101110011101100011001110110010101010100100100101101101111101111001101011000110011010111010111001110110001100111011001010101010010010010110110111101101011110 e6b19aeb9d8cecaa925b7de6b19aeb9d8cecaa925b7b5e
UHC 汚띌쪒[}汚띌쪒[{^ 1110011111111101101101101110100110100101100011000101101101111101111001111111110110110110111010011010010110001100010110110111101101011110 e7fdb6e9a58c5b7de7fdb6e9a58c5b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)