To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昻??裕??遺??????や?諛??億?‥異 111110101101000000111111001111111001011101010100001111110011111110001000111000100011111100111111001111110011111100111111001111111000001011100010001111111110011010000111001111110011111110001001101011010011111110000001011001001000100011011001 fad03f3f97543f3f88e23f3f3f3f3f3f82e23fe6873f3f89ad3f816488d9
EUC-JP ???裕??遺??????や?諛??億?‥異 0011111100111111001111111100110110110101001111110011111110110000111001000011111100111111001111110011111100111111001111111010010011100100001111111110101111100111001111110011111110110010101011110011111110100001110001011011000011011011 3f3f3fcdb53f3fb0e43f3f3f3f3f3fa4e43febe73f3fb2af3fa1c5b0db
UTF-8 昻뉗떝裕꾡슭遺살쭍列욧퍔璘や벧諛몄구億됰‥異 111001101001100010111011111010111000100110010111111010111001011010011101111010001010001110010101111010101011111010100001111011001000101010101101111010011000000110111010111011001000001010110100111011001010110110001101111011111010011010011100111011001001101010100111111011011000110110010100111011111010011110101111111000111000001010000100111010111011001010100111111010001010101110011011111010111010101010000100111010101011010110101100111001011000010010000100111010111001000010110000111000101000000010100101111001111001010110110000 e698bbeb8997eb969de8a395eabea1ec8aade981baec82b4ecad8defa69cec9aa7ed8d94efa7afe38284ebb2a7e8ab9bebaa84eab5ace58484eb90b0e280a5e795b0
UHC 昻뉗떝裕꾡슭遺살쭍列욧퍔璘や벧諛몄구億됰‥異 1110010011101001100001111110110010001011101100111110101110101110100001001110010010111101101111101110101110110110101110111110110010100111100001101110011011101010101111111110101010111011100010111110110011011110101010101110010010111010101001101110101110110000101110001110110010110001101110001110010111100010100010011110101110100001101001011110110010110110 e4e987ec8bb3ebae84e4bdbeebb6bbeca786e6eabfeabb8becdeaae4baa6ebb0b8ecb1b8e5e289eba1a5ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)