To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 烏l?藥θ。???烏l?藥θ。???^ 1000100101000111100000101000110000111111111001010101101010000011110001101000000101000010001111110011111100111111100010010100011110000010100011000011111111100101010110101000001111000110100000010100001000111111001111110011111101011110 8947828c3fe55a83c681423f3f3f8947828c3fe55a83c681423f3f3f5e
EUC-JP 烏l?藥θ。彛??烏l?藥θ。彛??^ 101100011010100010100011111011000011111111101001101110111010011011001000101000011010001110001111101111001111101000111111001111111011000110101000101000111110110000111111111010011011101110100110110010001010000110100011100011111011110011111010001111110011111101011110 b1a8a3ec3fe9bba6c8a1a38fbcfa3f3fb1a8a3ec3fe9bba6c8a1a38fbcfa3f3f5e
UTF-8 烏l츦藥θ。彛밴맏烏l츦藥θ。彛밴맏^ 1110011110000011100011111110111110111101100011001110110010111000101001101110100010010111101001011100111010111000111000111000000010000010111001011011110110011011111010111011000010110100111010111010011110001111111001111000001110001111111011111011110110001100111011001011100010100110111010001001011110100101110011101011100011100011100000001000001011100101101111011001101111101011101100001011010011101011101001111000111101011110 e7838fefbd8cecb8a6e897a5ceb8e38082e5bd9bebb0b4eba78fe7838fefbd8cecb8a6e897a5ceb8e38082e5bd9bebb0b4eba78f5e
UHC 烏l츦藥θ。彛밴맏烏l츦藥θ。彛밴맏^ 11101000101000011010001111101100101011101001110011100101101101111010010111101000101000011010001111101100101011011011100111101010101110001011101011101000101000011010001111101100101011101001110011100101101101111010010111101000101000011010001111101100101011011011100111101010101110001011101001011110 e8a1a3ecae9ce5b7a5e8a1a3ecadb9eab8bae8a1a3ecae9ce5b7a5e8a1a3ecadb9eab8ba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)