To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????nD?????????nD^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001000100001111110011111100111111001111110011111100111111001111110011111100111111011011100100010001011110 3f3f3f3f3f3f3f3f3f6e443f3f3f3f3f3f3f3f3f6e445e
SJIS-WIN 永??飮??碎??nD永??飮??碎??nD^ 1000100101101001001111110011111110011111010110100011111100111111111000011110101000111111001111110110111001000100100010010110100100111111001111111001111101011010001111110011111111100001111010100011111100111111011011100100010001011110 89693f3f9f5a3f3fe1ea3f3f6e4489693f3f9f5a3f3fe1ea3f3f6e445e
EUC-JP 永??飮??碎??nD永??飮??碎??nD^ 1011000111001010001111110011111111011101101110110011111100111111111000101110110000111111001111110110111001000100101100011100101000111111001111111101110110111011001111110011111111100010111011000011111100111111011011100100010001011110 b1ca3f3fddbb3f3fe2ec3f3f6e44b1ca3f3fddbb3f3fe2ec3f3f6e445e
UTF-8 永띠쉻飮당뇾碎㏃벁nD永띠쉻飮당뇾碎㏃벁nD^ 1110011010110000101110001110101110011101101000001110110010001001101110111110100110100011101011101110101110001011101110011110101110000111101111101110011110100010100011101110001110001111100000111110101110110010100000010110111001000100111001101011000010111000111010111001110110100000111011001000100110111011111010011010001110101110111010111000101110111001111010111000011110111110111001111010001010001110111000111000111110000011111010111011001010000001011011100100010001011110 e6b0b8eb9da0ec89bbe9a3aeeb8bb9eb87bee7a28ee38f83ebb2816e44e6b0b8eb9da0ec89bbe9a3aeeb8bb9eb87bee7a28ee38f83ebb2816e445e
UHC 永띠쉻飮당뇾碎㏃벁nD永띠쉻飮당뇾碎㏃벁nD^ 1110011110110101101101101110110010011010100100011110101111100110101101001110011110000111100111111110000111101111101001111110110010010011101001110110111001000100111001111011010110110110111011001001101010010001111010111110011010110100111001111000011110011111111000011110111110100111111011001001001110100111011011100100010001011110 e7b5b6ec9a91ebe6b4e7879fe1efa7ec93a76e44e7b5b6ec9a91ebe6b4e7879fe1efa7ec93a76e445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)