To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??鎰??銀ш??レ?誼??魏???ル? 1110001010100011001111110011111111101000010011000011111100111111100010111110001010000100100010100011111100111111100000111000110000111111100010110110001000111111001111111110100110110000001111110011111100111111100000111000101100111111 e2a33f3fe84c3f3f8be2848a3f3f838c3f8b623f3fe9b03f3f3f838b3f
EUC-JP 筌??鎰??銀ш??レ?誼??魏???ル? 1110010010100101001111110011111111101111101011010011111100111111101101101110010010100111111010100011111100111111101001011110110000111111101101011100001100111111001111111111001010110010001111110011111100111111101001011110101100111111 e4a53f3fefad3f3fb6e4a7ea3f3fa5ec3fb5c33f3ff2b23f3f3fa5eb3f
UTF-8 筌뗪릿鎰싨에銀ш쉽曆レ뮆誼욘에魏됥렍曆ル쪢 1110011110101101100011001110101110010111101010101110101110100110101111111110100110001110101100001110110010001011101010001110110010010111100100001110100110001010100000001101000110001000111011001000100110111101111011111010011010001011111000111000001110101100111010111010111010000110111010001010101010111100111011001001101010011000111011001001011110010000111010011010110110001111111010111001000010100101111010111010000010001101111011111010011010001011111000111000001110101011111011001010101010100010 e7ad8ceb97aaeba6bfe98eb0ec8ba8ec9790e98a80d188ec89bdefa68be383acebae86e8aabcec9a98ec9790e9ad8feb90a5eba08defa68be383abecaaa2
UHC 筌뗪릿鎰싨에銀ш쉽曆レ뮆誼욘에魏됥렍曆ル쪢 111011111010011110001011111010101011100010110100111011001111000010011010111001101011111110100001111010111101111010101100111010101011110110110001111001101011011110101011111011001001001010010101111010111111111010111111111001101011111110100001111010101110000010001001111000111000111010100011111001101011011110101011111010111010010110011011 efa78beab8b4ecf09ae6bfa1ebdeaceabdb1e6b7abec9295ebfebfe6bfa1eae089e38ea3e6b7abeba59b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)