To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 絶??絶??嶸??章?????饒?????^ 100100001110001000111111001111111001000011100010001111110011111111111010101101000011111100111111100011111100110100111111001111110011111100111111001111111110100101100000001111110011111100111111001111110011111101011110 90e23f3f90e23f3ffab43f3f8fcd3f3f3f3f3fe9603f3f3f3f3f5e
EUC-JP 絶??絶??嶸??章??焰??饒?????^ 110000001110010000111111001111111100000011100100001111110011111110001111101110111111010000111111001111111011111011001111001111110011111110001111110010011110111100111111001111111111000111000001001111110011111100111111001111110011111101011110 c0e43f3fc0e43f3f8fbbf43f3fbecf3f3f8fc9ef3f3ff1c13f3f3f3f3f5e
UTF-8 絶쀧씮絶쏃츕嶸뤹컩章듸풓焰울풘饒뤹츕怜묔벘^ 11100111101101011011011011101100100000001010011111101100100101001010111011100111101101011011011011101100100011111000001111101100101110001001010111100101101101101011100011101011101001001011100111101100101110111010100111100111101010111010000011101011100100111011100011101101100100101001001111100111100001001011000011101100100110101011100011101101100100101001100011101001101001011001001011101011101001001011100111101100101110001001010111101111101001101010110011101011101011001001010011101011101100101001100001011110 e7b5b6ec80a7ec94aee7b5b6ec8f83ecb895e5b6b8eba4b9ecbba9e7aba0eb93b8ed9293e784b0ec9ab8ed9298e9a592eba4b9ecb895efa6acebac94ebb2985e
UHC 絶쀧씮絶쏃츕嶸뤹컩章듸풓焰울풘饒뤹츕怜묔벘^ 11101111101111101001011111100111100111011011111111101111101111101001101111101001101011101000111111100111101011101000111111100111101100001001000111101101111100011011010111101111101111101001011111100110111110111011111111101111101111101001101111101001101011101000111111100111101011101000111111100111101100001001000111101110100100111011010101011110 efbe97e79dbfefbe9be9ae8fe7ae8fe7b091edf1b5efbe97e6fbbfefbe9be9ae8fe7ae8fe7b091ee93b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)