To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?油???щ┛?????Ⅴ貫??ぜ 111001001110100010000010111010100011111110010110111110110011111100111111001111111000010010001011100001001010111000111111001111110011111100111111001111111000011101011000100010101101000100111111001111111000001010111010 e4e882ea3f96fb3f3f3f848b84ae3f3f3f3f3f87588ad13f3f82ba
EUC-JP 蒻れ?油???щ┛??????貫靷?ぜ 11101000111010101010010011101100001111111100110011111101001111110011111100111111101001111110101110101000101100000011111100111111001111110011111100111111001111111011010011010011100011111110011110111101001111111010010010111100 e8eaa4ec3fccfd3f3f3fa7eba8b03f3f3f3f3f3fb4d38fe7bd3fa4bc
UTF-8 蒻れ슜油꾣끽戮щ┛廬믩똾留㏆Ⅴ貫靷숃ぜ 1110100010010010101110111110001110000010100011001110110010001010100111001110011010110010101110011110101010111110101000111110101110000001101111011110111110100111100100101101000110001001111000101001010010011011111011111010011010000010111010111010111110101001111010111001100010111110111011111010011110001101111000111000111110000110111000101000010110100100111010001011001010101011111010011001110110110111111011001000100010000011111000111000000110011100 e892bbe3828cec8a9ce6b2b9eabea3eb81bdefa792d189e2949befa682ebafa9eb98beefa78de38f86e285a4e8b2abe99db7ec8883e3819c
UHC 蒻れ슜油꾣끽戮щ┛廬믩똾留㏆Ⅴ貫靷숃ぜ 1110010110110110101010101110110010011010101010011110101011111010100001001110011010110011101000111110101110111101101011001110101110100110101100001110010111111110100100101110101110001100100001001110101110100111101001111110111110100101101101001100111010111011111011001110011010011001111010001010101010111100 e5b6aaec9aa9eafa84e6b3a3ebbdaceba6b0e5fe92eb8c84eba7a7efa5b4cebbece699e8aabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)