To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8柚??宥??沃??日??貫?椰??? 1110000110011111001111111000001001010111100101110100110100111111001111111001011101000111001111110011111110010111100000000011111100111111100100111111101000111111001111111000101011010001001111111001111010111101001111110011111100111111 e19f3f8257974d3f3f97473f3f97803f3f93fa3f3f8ad13f9ebd3f3f3f
EUC-JP 癲?8柚??宥??沃??日??貫?椰??? 1110001010100001001111111010001110111000110011011010111000111111001111111100110110101000001111110011111111001101111000000011111100111111110001101111110000111111001111111011010011010011001111111101110010111111001111110011111100111111 e2a13fa3b8cdae3f3fcda83f3fcde03f3fc6fc3f3fb4d33fdcbf3f3f3f
UTF-8 癲쒕8柚삯뜮宥룻뫛沃섃뫜日딉쬅貫캉椰꾟몺柳 111001111001100110110010111011001001001010010101111011111011110010011000111001101001111110011010111011001000001010101111111010111001110010101110111001011010111010100101111010111010001110111011111010111010101110011011111001101011001010000011111011001000010010000011111010111010101110011100111001101001011110100101111010111001010010001001111011001010110010000101111010001011001010101011111011001011101010001001111001101010010010110000111010101011111010011111111010111010101010111010111011111010011110001001 e799b2ec9295efbc98e69f9aec82afeb9caee5aea5eba3bbebab9be6b283ec8483ebab9ce697a5eb9489ecac85e8b2abecba89e6a4b0eabe9febaabaefa789
UHC 癲쒕8柚삯뜮宥룻뫛沃섃뫜日딉쬅貫캉椰꾟몺柳 111011111010011010011100111010111010001110111000111010101111011010111011111010011000110110101110111010101110100110110111111011011001000110111011111010001010101010011000111000101001000110111100111011001110110110001010111011111010011010011100110011101011101111000100101100101110010110101011100001001110001010010001101000001110101011110111 efa69ceba3b8eaf6bbe98daeeae9b7ed91bbe8aa98e291bceced8aefa69ccebbc4b2e5ab84e291a0eaf7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)