To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 鉛?????悅?Щ[鉛?????悅?Щ[^ 100010011001010000111111001111110011111100111111001111111111101010111101001111111000010001011010010110111000100110010100001111110011111100111111001111110011111111111010101111010011111110000100010110100101101101011110 89943f3f3f3f3ffabd3f845a5b89943f3f3f3f3ffabd3f845a5b5e
EUC-JP 鉛???????Щ[鉛???????Щ[^ 10110001111101000011111100111111001111110011111100111111001111110011111110100111101110110101101110110001111101000011111100111111001111110011111100111111001111110011111110100111101110110101101101011110 b1f43f3f3f3f3f3f3fa7bb5bb1f43f3f3f3f3f3f3fa7bb5b5e
UTF-8 鉛뗧룉遼사뎸悅덆Щ[鉛뗧룉遼사뎸悅덆Щ[^ 11101001100010011001101111101011100101111010011111101011101000111000100111101111101001111000001111101100100000101010110011101011100011101011100011100110100000101000010111101011100011011000011011010000101010010101101111101001100010011001101111101011100101111010011111101011101000111000100111101111101001111000001111101100100000101010110011101011100011101011100011100110100000101000010111101011100011011000011011010000101010010101101101011110 e9899beb97a7eba389efa783ec82aceb8eb8e68285eb8d86d0a95be9899beb97a7eba389efa783ec82aceb8eb8e68285eb8d86d0a95b5e
UHC 鉛뗧룉遼사뎸悅덆Щ[鉛뗧룉遼사뎸悅덆Щ[^ 111001101110011110001011111001111000111110001000111010011010110010111011111001111000100110001011111001101110110110001000111010011010110010111011010110111110011011100111100010111110011110001111100010001110100110101100101110111110011110001001100010111110011011101101100010001110100110101100101110110101101101011110 e6e78be78f88e9acbbe7898be6ed88e9acbb5be6e78be78f88e9acbbe7898be6ed88e9acbb5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)