To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 沃?????淫??汚??違??恂ル?壓 1001011110000000001111110011111100111111001111110011111110001000111110100011111100111111100010011001100000111111001111111000100011100001001111110011111110011100100101101000001110001011001111111001101011011000 97803f3f3f3f3f88fa3f3f89983f3f88e13f3f9c96838b3f9ad8
EUC-JP 沃??堉??淫??汚??違??恂ル?壓 11001101111000000011111100111111100011111011011111111101001111110011111110110000111111000011111100111111101100011111100000111111001111111011000011100011001111110011111111010111111101101010010111101011001111111101010011011010 cde03f3f8fb7fd3f3fb0fc3f3fb1f83f3fb0e33f3fd7f6a5eb3fd4da
UTF-8 沃욌쪇堉붷렘淫뗪턀汚삳슔違뗰쫳恂ル늉壓 111001101011001010000011111011001001101010001100111011001010101010000111111001011010000010001001111010111011011010110111111010111010000010011000111001101011011110101011111010111001011110101010111011011000010010000000111001101011000110011010111011001000001010110011111011001000101010010100111010011000000110010101111010111001011110110000111011001010101110110011111001101000000110000010111000111000001110101011111010111000101010001001111001011010001110010011 e6b283ec9a8cecaa87e5a089ebb6b7eba098e6b7abeb97aaed8480e6b19aec82b3ec8a94e98195eb97b0ecabb3e68182e383abeb8a89e5a393
UHC 沃욌쪇堉붷렘淫뗪턀汚삳슔違뗰쫳恂ル늉壓 1110100010101010100111101110101110100101100000011110101110111100100101001110010110110111101111011110101111100010100010111110101010110101100111001110011111111101101110111110101110011010101000111110101011011110100010111110111110100110100010111110001011100001101010111110101110110100101111111110010011100010 e8aa9eeba581ebbc94e5b7bdebe28beab59ce7fdbbeb9aa3eade8befa68be2e1abebb4bfe4e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)