To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???誼よ?柔レ????誼よ?柔レ?B 001111110011111100111111100010110110001010000010111001100011111110001111010111111000001110001100001111110011111100111111001111111000101101100010100000101110011000111111100011110101111110000011100011000011111101000010 3f3f3f8b6282e63f8f5f838c3f3f3f3f8b6282e63f8f5f838c3f42
EUC-JP 孼??誼よ?柔レ?孼??誼よ?柔レ?B 10001111101110101100001100111111001111111011010111000011101001001110100000111111101111011100000010100101111011000011111110001111101110101100001100111111001111111011010111000011101001001110100000111111101111011100000010100101111011000011111101000010 8fbac33f3fb5c3a4e83fbdc0a5ec3f8fbac33f3fb5c3a4e83fbdc0a5ec3f42
UTF-8 孼띠룆誼よ몛柔レ젳孼띠룆誼よ몛柔レ젳B 11100101101011011011110011101011100111011010000011101011101000111000011011101000101010101011110011100011100000101000100011101011101010101001101111100110100111111001010011100011100000111010110011101100101000001011001111100101101011011011110011101011100111011010000011101011101000111000011011101000101010101011110011100011100000101000100011101011101010101001101111100110100111111001010011100011100000111010110011101100101000001011001101000010 e5adbceb9da0eba386e8aabce38288ebaa9be69f94e383aceca0b3e5adbceb9da0eba386e8aabce38288ebaa9be69f94e383aceca0b342
UHC 孼띠룆誼よ몛柔レ젳孼띠룆誼よ몛柔レ젳B 11100101111011011011011011101100100011111000010111101011111111101010101011101000100100011000100111101010111101011010101111101100101000001010011111100101111011011011011011101100100011111000010111101011111111101010101011101000100100011000100111101010111101011010101111101100101000001010011101000010 e5edb6ec8f85ebfeaae89189eaf5abeca0a7e5edb6ec8f85ebfeaae89189eaf5abeca0a742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)