To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘊??縊d?癲?=??????????ょお 11100101010111010011111100111111111000110110111110000010100001000011111111100001100111110011111110000001100000010011111100111111001111110011111100111111001111110011111100111111001111110011111110000010111001011000001010101000 e55d3f3fe36f82843fe19f3f81813f3f3f3f3f3f3f3f3f3f82e582a8
EUC-JP 蘊??縊d?癲?=??????????ょお 11101001101111100011111100111111111001011101000010100011111001000011111111100010101000010011111110100001111000010011111100111111001111110011111100111111001111110011111100111111001111110011111110100100111001111010010010101010 e9be3f3fe5d0a3e43fe2a13fa1e13f3f3f3f3f3f3f3f3f3fa4e7a4aa
UTF-8 蘊딅젨縊d틮癲뚩=溜⑵텤料뚮젶溜쀤슭溜ょお 111010001001100010001010111010111001010010000101111011001010000010101000111001111011100010001010111011111011110110000100111011011000101110101110111001111001100110110010111010111001101010101001111011111011110010011101111011111010011110001011111000101001000110110101111011011000010110100100111011111010011010111110111010111001101010101110111011001010000010110110111011111010011110001011111011001000000010100100111011001000101010101101111011111010011110001011111000111000001010000111111000111000000110001010 e8988aeb9485eca0a8e7b88aefbd84ed8baee799b2eb9aa9efbc9defa78be291b5ed85a4efa6beeb9aaeeca0b6efa78bec80a4ec8aadefa78be38287e3818a
UHC 蘊딅젨縊d틮癲뚩=溜⑵텤料뚮젶溜쀤슭溜ょお 111010001011001110001010111010111010000010100000111001001111110010100011111001001011101010011000111011111010011010001100111010001010001110111101111010101111111010101001111010001011011010011001111010001111011110001100111010111010000010101010111010101111111010010111111001001011110110111110111010101111111010101010111001111010101010101010 e8b38aeba0a0e4fca3e4ba98efa68ce8a3bdeafea9e8b699e8f78ceba0aaeafe97e4bdbeeafeaae7aaaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)