To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昻??裕??儒?┛???猿??瑤??喩?┐ 1111101011010000001111110011111110010111010101000011111100111111100011101111001000111111100001001010111000111111001111110011111110001001100011100011111100111111111010101010001000111111001111111001101001100111001111111000010010100010 fad03f3f97543f3f8ef23f84ae3f3f3f898e3f3feaa23f3f9a673f84a2
EUC-JP ???裕??儒?┛???猿??瑤??喩?┐ 00111111001111110011111111001101101101010011111100111111101111001111010000111111101010001011000000111111001111110011111110110001111011100011111100111111111101001010010000111111001111111101001111001000001111111010100010100100 3f3f3fcdb53f3fbcf43fa8b03f3f3fb1ee3f3ff4a43f3fd3c83fa8a4
UTF-8 昻뉗떝裕뉒뙴儒삳┛裂섓쭔猿놁쪠瑤녠쑬喩쇽┐ 111001101001100010111011111010111000100110010111111010111001011010011101111010001010001110010101111010111000100110010010111010111001100110110100111001011000010010010010111011001000001010110011111000101001010010011011111011111010011010100000111011001000010010010011111011001010110110010100111001111000110010111111111010111000011010000001111011001010101010100000111001111001000110100100111010111000010110100000111011001001000110101100111001011001011010101001111011001000011110111101111000101001010010010000 e698bbeb8997eb969de8a395eb8992eb99b4e58492ec82b3e2949befa6a0ec8493ecad94e78cbfeb8681ecaaa0e791a4eb85a0ec91ace596a9ec87bde29490
UHC 昻뉗떝裕뉒뙴儒삳┛裂섓쭔猿놁쪠瑤녠쑬喩쇽┐ 111001001110100110000111111011001000101110110011111010111010111010000111111001111000110010110111111010101110001110111011111010111010011010110000111001101111000110011000111011111010011110001100111010101011101110000110111011001010010110011001111010001111110110110011111010101011111010101000111010101110011110111100111011111010011010100100 e4e987ec8bb3ebae87e78cb7eae3bbeba6b0e6f198efa78ceabb86eca599e8fdb3eabea8eae7bcefa6a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)