To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?忽??籬忽矯??痢宏?忽??籬忽矯??痢槐^ 0011111110001101100110100011111100111111111000101101111110001101100110101000101110111000001111110011111110010111100111111000110101000111001111111000110110011010001111110011111111100010110111111000110110011010100010111011100000111111001111111001011110011111100111101100010101011110 3f8d9a3f3fe2df8d9a8bb83f3f979f8d473f8d9a3f3fe2df8d9a8bb83f3f979f9ec55e
EUC-JP ?忽熢?籬忽矯熢?痢宏?忽熢?籬忽矯熢?痢槐^ 00111111101110011111101010001111110010101010101100111111111001001110000110111001111110101011011010111010100011111100101010101011001111111100111010100001101110011010100000111111101110011111101010001111110010101010101100111111111001001110000110111001111110101011011010111010100011111100101010101011001111111100111010100001110111001100011101011110 3fb9fa8fcaab3fe4e1b9fab6ba8fcaab3fcea1b9a83fb9fa8fcaab3fe4e1b9fab6ba8fcaab3fcea1dcc75e
UTF-8 뤑忽熢첁籬忽矯熢첁痢宏뤑忽熢첁籬忽矯熢첁痢槐^ 11101011101001001001000111100101101111111011110111100111100001101010001011101100101100101000000111100111101100011010110011100101101111111011110111100111100111111010111111100111100001101010001011101100101100101000000111100111100101111010001011100101101011101000111111101011101001001001000111100101101111111011110111100111100001101010001011101100101100101000000111100111101100011010110011100101101111111011110111100111100111111010111111100111100001101010001011101100101100101000000111100111100101111010001011100110101001111001000001011110 eba491e5bfbde786a2ecb281e7b1ace5bfbde79fafe786a2ecb281e797a2e5ae8feba491e5bfbde786a2ecb281e7b1ace5bfbde79fafe786a2ecb281e797a2e6a7905e
UHC 뤑忽熢첁籬忽矯熢첁痢宏뤑忽熢첁籬忽矯熢첁痢槐^ 100011111100000111111011111011001101110011101100101010101000111011010111111001101111101111101100110011101110110011011100111011001010101010001110110101111110010111001110110110111000111111000001111110111110110011011100111011001010101010001110110101111110011011111011111011001100111011101100110111001110110010101010100011101101011111100101110011101101100101011110 8fc1fbecdcecaa8ed7e6fbecceecdcecaa8ed7e5cedb8fc1fbecdcecaa8ed7e6fbecceecdcecaa8ed7e5ced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)