To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 狎?????淹??W}狎?????淹??W{^ 111000001011111000111111001111110011111100111111001111111001111110111001001111110011111101010111011111011110000010111110001111110011111100111111001111110011111110011111101110010011111100111111010101110111101101011110 e0be3f3f3f3f3f9fb93f3f577de0be3f3f3f3f3f9fb93f3f577b5e
EUC-JP 狎?????淹??W}狎?????淹??W{^ 111000001100000000111111001111110011111100111111001111111101111010111011001111110011111101010111011111011110000011000000001111110011111100111111001111110011111111011110101110110011111100111111010101110111101101011110 e0c03f3f3f3f3fdebb3f3f577de0c03f3f3f3f3fdebb3f3f577b5e
UTF-8 狎숅뀒廉뉛쉥淹쒑춼W}狎숅뀒廉뉛쉥淹쒑춼W{^ 1110011110001011100011101110110010001000100001011110101110000000100100101110111110100110101000101110101110001001100110111110110010001001101001011110011010110111101110011110110010010010100100011110110010110110101111000101011101111101111001111000101110001110111011001000100010000101111010111000000010010010111011111010011010100010111010111000100110011011111011001000100110100101111001101011011110111001111011001001001010010001111011001011011010111100010101110111101101011110 e78b8eec8885eb8092efa6a2eb899bec89a5e6b7b9ec9291ecb6bc577de78b8eec8885eb8092efa6a2eb899bec89a5e6b7b9ec9291ecb6bc577b5e
UHC 狎숅뀒廉뉛쉥淹쒑춼W}狎숅뀒廉뉛쉥淹쒑춼W{^ 1110010011100100100110011110100110000101100011001110011011110101100001111110111110111101101010111110010111110100100111001110100010101101100110000101011101111101111001001110010010011001111010011000010110001100111001101111010110000111111011111011110110101011111001011111010010011100111010001010110110011000010101110111101101011110 e4e499e9858ce6f587efbdabe5f49ce8ad98577de4e499e9858ce6f587efbdabe5f49ce8ad98577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)