To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 邀??逸??濡ル?筌??義??怨??雅 111001111011000100111111001111111000100011101101001111110011111110010100010001111000001110001011001111111110001010100011001111110011111110001011011000000011111100111111100010011000010100111111001111111000100111101011 e7b13f3f88ed3f3f9447838b3fe2a33f3f8b603f3f89853f3f89eb
EUC-JP 邀??逸??濡ル?筌??義??怨??雅 111011101011001100111111001111111011000011101111001111110011111111000111101010001010010111101011001111111110010010100101001111110011111110110101110000010011111100111111101100011110010100111111001111111011001011101101 eeb33f3fb0ef3f3fc7a8a5eb3fe4a53f3fb5c13f3fb1e53f3fb2ed
UTF-8 邀싲슣逸뤄쭓濡ル뼬筌뗭쥙義억쭓怨뺤젡雅 111010011000001010000000111011001000101110110010111011001000101010100011111010011000000010111000111010111010010010000100111011001010110110010011111001101011111110100001111000111000001110101011111010111011110010101100111001111010110110001100111010111001011110101101111011001010010110011001111001111011111010101001111011001001011010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010100001111010011001101110000101 e98280ec8bb2ec8aa3e980b8eba484ecad93e6bfa1e383abebbcace7ad8ceb97adeca599e7bea9ec96b5ecad93e680a8ebbaa4eca0a1e99b85
UHC 邀싲슣逸뤄쭓濡ル뼬筌뗭쥙義억쭓怨뺤젡雅 1110100110101101100110101110101110011010101011111110110011101111101101111110111110100111100010111110101110100001101010111110101110010110101011111110111110100111100010111110110010100010100011101110101111111001101111101110111110100111100010111110101010110011100101011110110010100000100110101110010010111010 e9ad9aeb9aafecefb7efa78beba1abeb96afefa78beca28eebf9beefa78beab395eca09ae4ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)