To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 伍??誼よ?節??櫻?????怨???l?誼 100011001101111000111111001111111000101101100010100000101110011000111111100100001101111100111111001111111001111101001110001111110011111100111111001111110011111110001001100001010011111100111111001111111000001010001100001111111000101101100010 8cde3f3f8b6282e63f90df3f3f9f4e3f3f3f3f3f89853f3f3f828c3f8b62
EUC-JP 伍??誼よ?節??櫻?????怨??渶l?誼 1011100011100000001111110011111110110101110000111010010011101000001111111100000011100001001111110011111111011101101011110011111100111111001111110011111100111111101100011110010100111111001111111000111111000111111011011010001111101100001111111011010111000011 b8e03f3fb5c3a4e83fc0e13f3fddaf3f3f3f3f3fb1e53f3f8fc7eda3ec3fb5c3
UTF-8 伍밸맧誼よ돳節뗭젩櫻뗭쥋李롳쭓怨뺤젃渶l쉶誼 111001001011110010001101111010111011000010111000111010111010011110100111111010001010101010111100111000111000001010001000111010111000111110110011111001111010111110000000111010111001011110101101111011001010000010101001111001101010101110111011111010111001011110101101111011001010010110001011111011111010011110100001111010111010000110110011111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010000011111001101011100010110110111011111011110110001100111011001000100110110110111010001010101010111100 e4bc8debb0b8eba7a7e8aabce38288eb8fb3e7af80eb97adeca0a9e6abbbeb97adeca58befa7a1eba1b3ecad93e680a8ebbaa4eca083e6b8b6efbd8cec89b6e8aabc
UHC 伍밸맧誼よ돳節뗭젩櫻뗭쥋李롳쭓怨뺤젃渶l쉶誼 1110011111101010101110011110101110010000101100001110101111111110101010101110100010001001101101101110111110111101100010111110110010100000101000011110010110100001100010111110110010100010100001001110110010110000100011101110111110100111100010111110101010110011100101011110110010100000100001111110011110110111101000111110110010011010100011001110101111111110 e7eab9eb90b0ebfeaae889b6efbd8beca0a1e5a18beca284ecb08eefa78beab395eca087e7b7a3ec9a8cebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)