To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????▼?諭??巍ル?二??遺??沃 0011111100111111001111110011111110000001101001010011111110010111010000000011111100111111100110111101100110000011100010110011111110010011111100010011111100111111100010001110001000111111001111111001011110000000 3f3f3f3f81a53f97403f3f9bd9838b3f93f13f3f88e23f3f9780
EUC-JP ???沅▼?諭??巍ル?二??遺??沃 00111111001111110011111110001111110001101110100110100010101001110011111111001101101000010011111100111111110101101101101110100101111010110011111111000110111100110011111100111111101100001110010000111111001111111100110111100000 3f3f3f8fc6e9a2a73fcda13f3fd6dba5eb3fc6f33f3fb0e43f3fcde0
UTF-8 嶺뚮뿭沅▼츦諭꾩춷巍ル봿二녘몛遺얘턁沃 111011111010011010101011111010111001101010101110111010111011111110101101111001101011001010000101111000101001011010111100111011001011100010100110111010001010101110101101111010101011111010101001111011001011011010110111111001011011011110001101111000111000001110101011111010111011010010111111111001001011101010001100111010111000010110011000111010111010101010011011111010011000000110111010111011001001011010011000111011011000010010000001111001101011001010000011 efa6abeb9aaeebbfade6b285e296bcecb8a6e8abadeabea9ecb6b7e5b78de383abebb4bfe4ba8ceb8598ebaa9be981baec9698ed8481e6b283
UHC 嶺뚮뿭沅▼츦諭꾩춷巍ル봿二녘몛遺얘턁沃 1110011110101101100011001110101110010111101011011110101010110110101000011110010110101110100111001110101110110001100001001110110010101101100100111110100011100100101010111110101110010100100001101110110010100011101100111110100010010001100010011110101110110110101111101110101010110101100111011110100010101010 e7ad8ceb97adeab6a1e5ae9cebb184ecad93e8e4abeb9486eca3b3e89189ebb6beeab59de8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)