To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦????ⅹ倭??譽??鸚?オ???沃??岳 100010010101000100111111001111110011111100111111111110100100100110011000011000000011111100111111111001101010001100111111001111111110101001011111001111111000001101001001001111110011111100111111100101111000000000111111001111111000101001111000 89513f3f3f3ffa4998603f3fe6a33f3fea5f3f83493f3f3f97803f3f8a78
EUC-JP 渦??孼??倭??譽??鸚?オ???沃??岳 10110001101100100011111100111111100011111011101011000011001111110011111111001111110000010011111100111111111011001010010100111111001111111111001111000000001111111010010110101010001111110011111100111111110011011110000000111111001111111011001111011001 b1b23f3f8fbac33f3fcfc13f3feca53f3ff3c03fa5aa3f3f3fcde03f3fb3d9
UTF-8 渦욘씇孼껃ⅹ倭잒큹譽길겘鸚싩オ呂잒퀕沃곈걶岳 111001101011100010100110111011001001101010011000111011001001010010000111111001011010110110111100111010101011101110000011111000101000010110111001111001011000000010101101111011001001111010010010111011011000000110111001111010001010110110111101111010101011100010111000111010101011001010011000111010011011100010011010111011001000101110101001111000111000001010101010111011111010011010000000111011001001111010010010111011011000000010010101111001101011001010000011111010101011001110001000111010101011000110110110111001011011001010110011 e6b8a6ec9a98ec9487e5adbceabb83e285b9e580adec9e92ed81b9e8adbdeab8b8eab298e9b89aec8ba9e382aaefa680ec9e92ed8095e6b283eab388eab1b6e5b2b3
UHC 渦욘씇孼껃ⅹ倭잒큹譽길겘鸚싩オ呂잒퀕沃곈걶岳 1110100010111110101111111110011010011101100111111110010111101101100000111110010110100101101010101110100011011110100111111110100010110100100010001110011111100010101100011110011010000001101011111110010110100100100110101110011110101011101010101110010111111011100111111110100010110011100010101110100010101010101100001110100110000001100111001110010010111111 e8bebfe69d9fe5ed83e5a5aae8de9fe8b488e7e2b1e681afe5a49ae7abaae5fb9fe8b38ae8aab0e9819ce4bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)