To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??宥????─吟??泣η?爰⑤?飮 10001001011010010011111100111111100101110100011100111111001111110011111100111111100001001001111110001011111000010011111100111111100010111000001110000011110001010011111111100000101001111000011101000100001111111001111101011010 89693f3f97473f3f3f3f849f8be13f3f8b8383c53fe0a787443f9f5a
EUC-JP 永??宥????─吟??泣η?爰??飮 101100011100101000111111001111111100110110101000001111110011111100111111001111111010100010100001101101101110001100111111001111111011010111100011101001101100011100111111111000001010100100111111001111111101110110111011 b1ca3f3fcda83f3f3f3fa8a1b6e33f3fb5e3a6c73fe0a93f3fddbb
UTF-8 永띔벰宥살넗練뚳─吟⑸뙋泣η독爰⑤뙋飮 1110011010110000101110001110101110011101100101001110101110110010101100001110010110101110101001011110110010000010101101001110101110000100100101111110111110100110100101101110101110011010101100111110001010010100100000001110010110010000100111111110001010010001101110001110101110011001100010111110011010110011101000111100111010110111111010111000111110000101111001111000100010110000111000101001000110100100111010111001100110001011111010011010001110101110 e6b0b8eb9d94ebb2b0e5aea5ec82b4eb8497efa696eb9ab3e29480e5909fe291b8eb998be6b3a3ceb7eb8f85e788b0e291a4eb998be9a3ae
UHC 永띔벰宥살넗練뚳─吟⑸뙋泣η독爰⑤뙋飮 1110011110110101101101101110101010111010101010001110101011101001101110111110110010000110101000001110011011011111100011001110111110100110101000011110101111100001101010011110101110001100100100001110101111101000101001011110011110110101101101101110101010111010101010001110101110001100100100001110101111100110 e7b5b6eabaa8eae9bbec86a0e6df8cefa6a1ebe1a9eb8c90ebe8a5e7b5b6eabaa8eb8c90ebe6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)