To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????矣?????溢i┃怨?????? 001111110011111100111111001111110011111100111111111000011110000100111111001111110011111100111111001111111000100011101100100000101000100110000100101010111000100110000101001111110011111100111111001111110011111100111111 3f3f3f3f3f3fe1e13f3f3f3f3f88ec828984ab89853f3f3f3f3f3f
EUC-JP ???堉??矣?????溢i┃怨?????? 0011111100111111001111111000111110110111111111010011111100111111111000101110001100111111001111110011111100111111001111111011000011101110101000111110100110101000101011011011000111100101001111110011111100111111001111110011111100111111 3f3f3f8fb7fd3f3fe2e33f3f3f3f3fb0eea3e9a8adb1e53f3f3f3f3f3f
UTF-8 嶺뚢돦堉싩춱矣꾧콞曆욁굦溢i┃怨룻뜎嶺뚢돧紐 111011111010011010101011111010111001101010100010111010111000111110100110111001011010000010001001111011001000101110101001111011001011011010110001111001111001111110100011111010101011111010100111111011001011110110011110111011111010011010001011111011001001101010000001111010101011010110100110111001101011101010100010111011111011110110001001111000101001010010000011111001101000000010101000111010111010001110111011111010111001110010001110111011111010011010101011111010111001101010100010111010111000111110100111111011111010011110001111 efa6abeb9aa2eb8fa6e5a089ec8ba9ecb6b1e79fa3eabea7ecbd9eefa68bec9a81eab5a6e6baa2efbd89e29483e680a8eba3bbeb9c8eefa6abeb9aa2eb8fa7efa78f
UHC 嶺뚢돦堉싩춱矣꾧콞曆욁굦溢i┃怨룻뜎嶺뚢돧紐 1110011110101101100011001110001010001001101010101110101110111100100110101110011110101101100011011110101111111000100001001110101010110001100101101110011010110111100111101110001110000010100011001110110011101110101000111110100110100110101011011110101010110011101101111110110110001101100100011110011110101101100011001110001010001001101010111110101110101010 e7ad8ce289aaebbc9ae7ad8debf884eab196e6b79ee3828ceceea3e9a6adeab3b7ed8d91e7ad8ce289abebaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)