To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???維??循?????踰?┼???壹?? 0011111100111111001111111000100011011011001111110011111110001111011110100011111100111111001111110011111100111111111001101111101000111111100001001010100100111111001111110011111110011010111000110011111100111111 3f3f3f88db3f3f8f7a3f3f3f3f3fe6fa3f84a93f3f3f9ae33f3f
EUC-JP ???維??循?????踰?┼洧??壹?? 00111111001111110011111110110000110111010011111100111111101111011101101100111111001111110011111100111111001111111110110011111100001111111010100010101011100011111100011110110100001111110011111111010100111001010011111100111111 3f3f3fb0dd3f3fbddb3f3f3f3f3fecfc3fa8ab8fc7b43f3fd4e53f3f
UTF-8 捻꿎뱿維뽪끽循녿겱醴븐슧踰뽳┼洧쏅맧壹듕틛 111011111010011010100100111010101011111110001110111010111011000110111111111001111011011010101101111010111011110110101010111010111000000110111101111001011011111010101010111010111000010110111111111010101011001010110001111011111010011010110111111010111011100010010000111011001000101010100111111010001011100010110000111010111011110110110011111000101001010010111100111001101011010010100111111011001000111110000101111010111010011110100111111001011010001110111001111010111001001110010101111011011000101110011011 efa6a4eabf8eebb1bfe7b6adebbdaaeb81bde5beaaeb85bfeab2b1efa6b7ebb890ec8aa7e8b8b0ebbdb3e294bce6b4a7ec8f85eba7a7e5a3b9eb9395ed8b9b
UHC 捻꿎뱿維뽪끽循녿겱醴븐슧踰뽳┼洧쏅맧壹듕틛 111001101111011110110010111000101001001110100101111010111010101110010110111001101011001110100011111000101110000010000110111010111000000110111101111001111110010010111010111011001001101010110001111010111011001010010110111011111010011010101011111010101111101110011011111010111001000010110000111011001110110010110101111001001011101010001000 e6f7b2e293a5ebab96e6b3a3e2e086eb81bde7e4baec9ab1ebb296efa6abeafb9beb90b0ececb5e4ba88

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)