To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????銀?????宥?┏遺??筌?? 111000011001111100111111001111110011111100111111001111111000101111100010001111110011111100111111001111110011111110010111010001110011111110000100101011001000100011100010001111110011111111100010101000110011111100111111 e19f3f3f3f3f3f8be23f3f3f3f3f97473f84ac88e23f3fe2a33f3f
EUC-JP 癲?????銀??孼??宥?┏遺??筌?? 1110001010100001001111110011111100111111001111110011111110110110111001000011111100111111100011111011101011000011001111110011111111001101101010000011111110101000101011101011000011100100001111110011111111100100101001010011111100111111 e2a13f3f3f3f3fb6e43f3f8fbac33f3fcda83fa8aeb0e43f3fe4a53f3f
UTF-8 癲삳끃六쇘뙴銀㏓퉲孼꾬퐢宥길┏遺얜굦筌덉쾿 111001111001100110110010111011001000001010110011111010111000000110000011111011111010011110010001111011001000011110011000111010111001100110110100111010011000101010000000111000111000111110010011111011011000100110110010111001011010110110111100111010101011111010101100111011011001000010100010111001011010111010100101111010101011100010111000111000101001010010001111111010011000000110111010111011001001011010011100111010101011010110100110111001111010110110001100111010111000110110001001111011001011111010111111 e799b2ec82b3eb8183efa791ec8798eb99b4e98a80e38f93ed89b2e5adbceabeaced90a2e5aea5eab8b8e2948fe981baec969ceab5a6e7ad8ceb8d89ecbebf
UHC 癲삳끃六쇘뙴銀㏓퉲孼꾬퐢宥길┏遺얜굦筌덉쾿 111011111010011010111011111010111000010110111001111010111011101110111100111001111000110010110111111010111101111010100111111010111011100110001010111001011110110110000100111011111011110110001011111010101110100110110001111001101010011010101110111010111011011010111110111010111000001010001100111011111010011110001000111011001011001010010101 efa6bbeb85b9ebbbbce78cb7ebdea7ebb98ae5ed84efbd8beae9b1e6a6aeebb6beeb828cefa788ecb295

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)