To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鶯?????節▼?嵬??節η????昻??^ 1110100111110010001111110011111100111111001111110011111110010000110111111000000110100101001111111001101111001010001111110011111110010000110111111000001111000101001111110011111100111111001111111111101011010000001111110011111101011110 e9f23f3f3f3f3f90df81a53f9bca3f3f90df83c53f3f3f3ffad03f3f5e
EUC-JP 鶯?????節▼?嵬??節η?渶?????^ 111100101111010000111111001111110011111100111111001111111100000011100001101000101010011100111111110101101100110000111111001111111100000011100001101001101100011100111111100011111100011111101101001111110011111100111111001111110011111101011110 f2f43f3f3f3f3fc0e1a2a73fd6cc3f3fc0e1a6c73f8fc7ed3f3f3f3f3f5e
UTF-8 鶯뚳쉈練쇘퇅節▼떧嵬뚦즺節η겮渶뽳숯昻븀겮^ 111010011011011010101111111010111001101010110011111011001000100110001000111011111010011010010110111011001000011110011000111011011000011110000101111001111010111110000000111000101001011010111100111010111001011010100111111001011011010110101100111010111001101010100110111011001010011010111010111001111010111110000000110011101011011111101010101100101010111011100110101110001011011011101011101111011011001111101100100010001010111111100110100110001011101111101011101110001000000011101010101100101010111001011110 e9b6afeb9ab3ec8988efa696ec8798ed8785e7af80e296bceb96a7e5b5aceb9aa6eca6bae7af80ceb7eab2aee6b8b6ebbdb3ec88afe698bbebb880eab2ae5e
UHC 鶯뚳쉈練쇘퇅節▼떧嵬뚦즺節η겮渶뽳숯昻븀겮^ 11100101101000111000110011101111101111011010010111100110110111111011110011100111101101111001011011101111101111011010000111100101100010111011101011101000111000111000110011100101101000111000110011101111101111011010010111100111100000011011110011100111101101111001011011101111101111011010000111100100111010011011101011100111100000011011110001011110 e5a38cefbda5e6dfbce7b796efbda1e58bbae8e38ce5a38cefbda5e781bce7b796efbda1e4e9bae781bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)