To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??シ埃??吟??昻??獄??爾れ????B 0011111100111111100000110101011010011010101110100011111100111111100010111110000100111111001111111111101011010000001111110011111110001101100101100011111100111111100011101010001010000010111010100011111100111111001111110011111101000010 3f3f83569aba3f3f8be13f3ffad03f3f8d963f3f8ea282ea3f3f3f3f42
EUC-JP ??シ埃??吟?????獄??爾れ????B 00111111001111111010010110110111110101001011110000111111001111111011011011100011001111110011111100111111001111110011111110111001111101100011111100111111101111001010010010100100111011000011111100111111001111110011111101000010 3f3fa5b7d4bc3f3fb6e33f3f3f3f3fb9f63f3fbca4a4ec3f3f3f3f42
UTF-8 琉뗨シ埃덉나吟섅끏昻삳쨱獄룟몚爾れ깦栒욕떩B 11101111101001111000110011101011100101111010100011100011100000101011011111100101100111111000001111101011100011011000100111101011100000101001100011100101100100001001111111101100100001001000010111101011100000011000111111100110100110001011101111101100100000101011001111101100101010001011000111100111100011011000010011101011101000111001111111101011101010101001101011100111100010001011111011100011100000101000110011101010101110011010011011100110101000001001001011101100100110101001010111101011100101101010100101000010 efa78ceb97a8e382b7e59f83eb8d89eb8298e5909fec8485eb818fe698bbec82b3eca8b1e78d84eba39febaa9ae788bee3828ceab9a6e6a092ec9a95eb96a942
UHC 琉뗨シ埃덉나吟섅끏昻삳쨱獄룟몚爾れ깦栒욕떩B 11101011101001001000101111101000101010111011011111100100111011111000100011101100101100111010101011101011111000011001100011100011100001011011111111100100111010011011101111101011101001001000101111101000101010111011011111100101100100011000100011101100101100111010101011101100100000111001100011100010111000111011111111100101100010111011101101000010 eba48be8abb7e4ef88ecb3aaebe198e385bfe4e9bbeba48be8abb7e59188ecb3aaec8398e2e3bfe58bbb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)