To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 霎滉ケ冷皈邵コ襞晁セ滉ケ励Η邵コ譁ス^ 11101000101111101001111111100100101110011001011111100010111000011010010111100111101110001011101011100101111111001001110111101000101111101001111111100100101110011001011111100011100000111010010111100111101110001011101011100110100101101011110101011110 e8be9fe4b997e2e1a5e7b8bae5fc9de8be9fe4b997e383a5e7b8bae696bd5e
EUC-JP 霎滉ケ冷皈邵コ襞晁セ滉ケ励Η邵コ譁ス^ 11110000110000001101111011100110100011101011100111001110111001001110001010100111111011101011101010001110101110101110101011111110110110101110101010001110101111101101111011100110100011101011100111001110111001011010011010100111111011101011101010001110101110101110101111110110100011101011110101011110 f0c0dee68eb9cee4e2a7eeba8ebaeafedaea8ebedee68eb9cee5a6a7eeba8ebaebf68ebd5e
UTF-8 霎滉ケ冷皈邵コ襞晁セ滉ケ励Η邵コ譁ス^ 111010011001110010001110111001101011101110001001111011111011110110111001111001011000011010110111111001111001101010001000111010011000001010110101111011111011110110111010111010001010010110011110111001101001100110000001111011111011110110111110111001101011101110001001111011111011110110111001111001011000101010110001110011101001011111101001100000101011010111101111101111011011101011101000101011011000000111101111101111011011110101011110 e99c8ee6bb89efbdb9e586b7e79a88e982b5efbdbae8a59ee69981efbdbee6bb89efbdb9e58ab1ce97e982b5efbdbae8ad81efbdbd5e
UHC ?滉?冷?邵??晁?滉??Η邵?譁?^ 001111111111110011010001001111111101010111010010001111111110000111010000001111110011111111110000110001010011111111111100110100010011111100111111101001011100011111100001110100000011111111111100101001100011111101011110 3ffcd13fd5d23fe1d03f3ff0c53ffcd13f3fa5c7e1d03ffca63f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)