To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???援??儒??繹??揖х?淞る?壤?? 0011111100111111001111111000100110000111001111110011111110001110111100100011111100111111111000111000100000111111001111111001011101001011100001001000011100111111100111111100001010000010111010010011111110011010110111110011111100111111 3f3f3f89873f3f8ef23f3fe3883f3f974b84873f9fc282e93f9adf3f3f
EUC-JP ???援??儒??繹??揖х?淞る?壤?? 0011111100111111001111111011000111100111001111110011111110111100111101000011111100111111111001011110100000111111001111111100110110101100101001111110011100111111110111101100010010100100111010110011111111010100111000010011111100111111 3f3f3fb1e73f3fbcf43f3fe5e83f3fcdaca7e73fdec4a4eb3fd4e13f3f
UTF-8 嶺뚮뱪援㎪룚儒붽콞繹먮냱揖х춯淞る윪壤쏆칳 1110111110100110101010111110101110011010101011101110101110110001101010101110011010001111101101001110001110001110101010101110101110100011100110101110010110000100100100101110101110110110101111011110110010111101100111101110011110111001101110011110101110101000101011101110101110000011101100011110011010001111100101101101000110000101111011001011011010101111111001101011011110011110111000111000001010001011111011001001110010101010111001011010001110100100111011001000111110000110111011001011100110110011 efa6abeb9aaeebb1aae68fb4e38eaaeba39ae58492ebb6bdecbd9ee7b9b9eba8aeeb83b1e68f96d185ecb6afe6b79ee3828bec9caae5a3a4ec8f86ecb9b3
UHC 嶺뚮뱪援㎪룚儒붽콞繹먮냱揖х춯淞る윪壤쏆칳 111001111010110110001100111010111001001110010000111010101011010110100111111001101000111110010110111010101110001110010100111010101011000110010110111001101011101010010000111010111000011010000001111010111110011110101100111001111010110110001100111000011110011110101010111010111001111110101001111001011011110110011011111011001010111110000110 e7ad8ceb9390eab5a7e68f96eae394eab196e6ba90eb8681ebe7ace7ad8ce1e7aaeb9fa9e5bd9becaf86

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)