To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 輿??遊??淫??語?????濡ろ?筌λ?溢 10010111011000000011111100111111100101110101011000111111001111111000100011111010001111110011111110001100111010100011111100111111001111110011111100111111100101000100011110000010111010110011111111100010101000111000001111001001001111111000100011101100 97603f3f97563f3f88fa3f3f8cea3f3f3f3f3f944782eb3fe2a383c93f88ec
EUC-JP 輿??遊??淫??語?????濡ろ?筌λ?溢 11001101110000010011111100111111110011011011011100111111001111111011000011111100001111110011111110111000111011000011111100111111001111110011111100111111110001111010100010100100111011010011111111100100101001011010011011001011001111111011000011101110 cdc13f3fcdb73f3fb0fc3f3fb8ec3f3f3f3f3fc7a8a4ed3fe4a5a6cb3fb0ee
UTF-8 輿삳뿫遊쏙쭒淫뗫뀆語ⓦ꺂溜쀧춯濡ろ돪筌λ맟溢 1110100010111100101111111110110010000010101100111110101110111111101010111110100110000001100010101110110010001111100110011110110010101101100100101110011010110111101010111110101110010111101010111110101110000000100001101110100010101010100111101110001010010011101001101110101010111010100000101110111110100111100010111110110010000000101001111110110010110110101011111110011010111111101000011110001110000010100011011110101110001111101010101110011110101101100011001100111010111011111010111010011110011111111001101011101010100010 e8bcbfec82b3ebbfabe9818aec8f99ecad92e6b7abeb97abeb8086e8aa9ee293a6eaba82efa78bec80a7ecb6afe6bfa1e3828deb8faae7ad8ccebbeba79fe6baa2
UHC 輿삳뿫遊쏙쭒淫뗫뀆語ⓦ꺂溜쀧춯濡ろ돪筌λ맟溢 1110011010101011101110111110101110010111101010111110101110110100101111011110111110100111100010101110101111100010100010111110101110000101100000101110010111011110101010001110001110000011101010111110101011111110100101111110011110101101100011001110101110100001101010101110110110001001101011011110111110100111101001011110101110010000101011001110110011101110 e6abbbeb97abebb4bdefa78aebe28beb8582e5dea8e383abeafe97e7ad8ceba1aaed89adefa7a5eb90acecee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)