To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?る?意??揄ъ?燁 001111111000001011101001001111111000100011010011001111110011111110011101100010011000010010001100001111111111101101011001 3f82e93f88d33f3f9d89848c3ffb59
EUC-JP ?る?意??揄ъŊ燁 001111111010010011101011001111111011000011010101001111110011111111011001111010011010011111101100100011111010100110101011100011111100101010110011 3fa4eb3fb0d53f3fd9e9a7ec8fa9ab8fcab3
UTF-8 閭る틷意㎩젽揄ъŊ燁 11101111101001101000011011100011100000101000101111101101100010111011011111100110100001001000111111100011100011101010100111101100101000001011110111100110100011111000010011010001100010101100010110001010111001111000011110000001 efa686e3828bed8bb7e6848fe38ea9eca0bde68f84d18ac58ae78781
UHC 閭る틷意㎩젽揄ъŊ燁 1110011010101101101010101110101110111010100111101110101111110010101001111110010110100000101011111110101011110001101011001110110010101000101011111110011110100111 e6adaaebba9eebf2a7e5a0afeaf1aceca8afe7a7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)