To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 營??乳?.宥??嵬??愉ユ????暗 100110100111101000111111001111111001001111111011001111111000000101000100100101110100011100111111001111111001101111001010001111110011111110010110111110011000001110000110001111110011111100111111001111111000100011000011 9a7a3f3f93fb3f814497473f3f9bca3f3f96f983863f3f3f3f88c3
EUC-JP 營??乳?.宥??嵬??愉ユ?洧??暗 1101001111011011001111110011111111000110111111010011111110100001101001011100110110101000001111110011111111010110110011000011111100111111110011001111101110100101111001100011111110001111110001111011010000111111001111111011000011000101 d3db3f3fc6fd3fa1a5cda83f3fd6cc3f3fccfba5e63f8fc7b43f3fb0c5
UTF-8 營뚯궠乳꿴.宥몃츇嵬됯램愉ユ갭洧뺣뉼暗 111001111000011110011111111010111001101010101111111010101011011010100000111001001011100110110011111010101011111110110100111011111011110010001110111001011010111010100101111010111010101010000011111011001011100010000111111001011011010110101100111010111001000010101111111010111001111010101000111001101000010010001001111000111000001110100110111010101011000010101101111001101011010010100111111010111011101010100011111010111000100110111100111001101001101010010111 e7879feb9aafeab6a0e4b9b3eabfb4efbc8ee5aea5ebaa83ecb887e5b5aceb90afeb9ea8e68489e383a6eab0ade6b4a7ebbaa3eb89bce69a97
UHC 營뚯궠乳꿴.宥몃츇嵬됯램愉ユ갭洧뺣뉼暗 1110011110111101100011001110110010000010101100111110101011100001101100101110100110100011101011101110101011101001101110001110101110101110100001001110100011100011100010011110101010110111101001011110101011110000101010111110011010110000101110001110101011111011100101011110101110110100101111001110010011011110 e7bd8cec82b3eae1b2e9a3aeeae9b8ebae84e8e389eab7a5eaf0abe6b0b8eafb95ebb4bce4de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)