To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 咀坎???咀坎?縫?咀坎???咀坎?縫?^ 10011001111100001001101010101010001111110011111100111111100110011111000010011010101010100011111110010110010001000011111110011001111100001001101010101010001111110011111100111111100110011111000010011010101010100011111110010110010001000011111101011110 99f09aaa3f3f3f99f09aaa3f96443f99f09aaa3f3f3f99f09aaa3f96443f5e
EUC-JP 咀坎?琫?咀坎?縫?咀坎?琫?咀坎?縫?^ 1101001011110010110101001010110000111111100011111100110010101111001111111101001011110010110101001010110000111111110010111010010100111111110100101111001011010100101011000011111110001111110011001010111100111111110100101111001011010100101011000011111111001011101001010011111101011110 d2f2d4ac3f8fccaf3fd2f2d4ac3fcba53fd2f2d4ac3f8fccaf3fd2f2d4ac3fcba53f5e
UTF-8 咀坎렩琫썹咀坎렩縫진咀坎렩琫썹咀坎렩縫진^ 11100101100100101000000011100101100111011000111011101011101000001010100111100111100100001010101111101100100011011011100111100101100100101000000011100101100111011000111011101011101000001010100111100111101110001010101111101100101001111000010011100101100100101000000011100101100111011000111011101011101000001010100111100111100100001010101111101100100011011011100111100101100100101000000011100101100111011000111011101011101000001010100111100111101110001010101111101100101001111000010001011110 e59280e59d8eeba0a9e790abec8db9e59280e59d8eeba0a9e7b8abeca784e59280e59d8eeba0a9e790abec8db9e59280e59d8eeba0a9e7b8abeca7845e
UHC 咀坎렩琫썹咀坎렩縫진咀坎렩琫썹咀坎렩縫진^ 1110111010111010110010101110110010001110101101111101110011101101101111011110011111101110101110101100101011101100100011101011011111011100111011101100000111111000111011101011101011001010111011001000111010110111110111001110110110111101111001111110111010111010110010101110110010001110101101111101110011101110110000011111100001011110 eebacaec8eb7dcedbde7eebacaec8eb7dceec1f8eebacaec8eb7dcedbde7eebacaec8eb7dceec1f85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)