To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 馭?????碎??W}馭?????碎??W{^ 111010010110011000111111001111110011111100111111001111111110000111101010001111110011111101010111011111011110100101100110001111110011111100111111001111110011111111100001111010100011111100111111010101110111101101011110 e9663f3f3f3f3fe1ea3f3f577de9663f3f3f3f3fe1ea3f3f577b5e
EUC-JP 馭?????碎??W}馭?????碎??W{^ 111100011100011100111111001111110011111100111111001111111110001011101100001111110011111101010111011111011111000111000111001111110011111100111111001111110011111111100010111011000011111100111111010101110111101101011110 f1c73f3f3f3f3fe2ec3f3f577df1c73f3f3f3f3fe2ec3f3f577b5e
UTF-8 馭곣뫗李덃걖碎밤렍W}馭곣뫗李덃걖碎밤렍W{^ 1110100110100110101011011110101010110011101000111110101110101011100101111110111110100111101000011110101110001101100000111110101010110001100101101110011110100010100011101110101110110000101001001110101110100000100011010101011101111101111010011010011010101101111010101011001110100011111010111010101110010111111011111010011110100001111010111000110110000011111010101011000110010110111001111010001010001110111010111011000010100100111010111010000010001101010101110111101101011110 e9a6adeab3a3ebab97efa7a1eb8d83eab196e7a28eebb0a4eba08d577de9a6adeab3a3ebab97efa7a1eb8d83eab196e7a28eebb0a4eba08d577b5e
UHC 馭곣뫗李덃걖碎밤렍W}馭곣뫗李덃걖碎밤렍W{^ 1110010111011111100000011110001010010001101110011110110010110000100010001110011010000001100000011110000111101111101110011110001110001110101000110101011101111101111001011101111110000001111000101001000110111001111011001011000010001000111001101000000110000001111000011110111110111001111000111000111010100011010101110111101101011110 e5df81e291b9ecb088e68181e1efb9e38ea3577de5df81e291b9ecb088e68181e1efb9e38ea3577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)