To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??宥????─吟??揖η?爰⑤?泣 10001001011010010011111100111111100101110100011100111111001111110011111100111111100001001001111110001011111000010011111100111111100101110100101110000011110001010011111111100000101001111000011101000100001111111000101110000011 89693f3f97473f3f3f3f849f8be13f3f974b83c53fe0a787443f8b83
EUC-JP 永??宥????─吟??揖η?爰??泣 101100011100101000111111001111111100110110101000001111110011111100111111001111111010100010100001101101101110001100111111001111111100110110101100101001101100011100111111111000001010100100111111001111111011010111100011 b1ca3f3fcda83f3f3f3fa8a1b6e33f3fcdaca6c73fe0a93f3fb5e3
UTF-8 永띔벰宥룐뵖練뚳─吟⑸뙋揖η독爰⑤뙋泣 1110011010110000101110001110101110011101100101001110101110110010101100001110010110101110101001011110101110100011100100001110101110110101100101101110111110100110100101101110101110011010101100111110001010010100100000001110010110010000100111111110001010010001101110001110101110011001100010111110011010001111100101101100111010110111111010111000111110000101111001111000100010110000111000101001000110100100111010111001100110001011111001101011001110100011 e6b0b8eb9d94ebb2b0e5aea5eba390ebb596efa696eb9ab3e29480e5909fe291b8eb998be68f96ceb7eb8f85e788b0e291a4eb998be6b3a3
UHC 永띔벰宥룐뵖練뚳─吟⑸뙋揖η독爰⑤뙋泣 1110011110110101101101101110101010111010101010001110101011101001101101111110001010010100100110001110011011011111100011001110111110100110101000011110101111100001101010011110101110001100100100001110101111100111101001011110011110110101101101101110101010111010101010001110101110001100100100001110101111101000 e7b5b6eabaa8eae9b7e29498e6df8cefa6a1ebe1a9eb8c90ebe7a5e7b5b6eabaa8eb8c90ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)