To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??誼??怨??筌??誼??怨???ш?愉 100010010100011100111111001111111000101101100010001111110011111110001001100001010011111100111111111000101010001100111111001111111000101101100010001111110011111110001001100001010011111100111111001111111000010010001010001111111001011011111001 89473f3f8b623f3f89853f3fe2a33f3f8b623f3f89853f3f3f848a3f96f9
EUC-JP 烏??誼??怨??筌??誼??怨???ш?愉 101100011010100000111111001111111011010111000011001111110011111110110001111001010011111100111111111001001010010100111111001111111011010111000011001111110011111110110001111001010011111100111111001111111010011111101010001111111100110011111011 b1a83f3fb5c33f3fb1e53f3fe4a53f3fb5c33f3fb1e53f3f3fa7ea3fccfb
UTF-8 烏띾슢誼뤄쬃怨쀬뺍筌뗪퀡誼썹넭怨룹젴輦ш낯愉 1110011110000011100011111110101110011101101111101110110010001010101000101110100010101010101111001110101110100100100001001110110010101100100000111110011010000000101010001110110010000000101011001110101110111010100011011110011110101101100011001110101110010111101010101110110110000000101000011110100010101010101111001110110010001101101110011110101110000100101011011110011010000000101010001110101110100011101110011110110010100000101101001110111110100110100110001101000110001000111010111000001010101111111001101000010010001001 e7838feb9dbeec8aa2e8aabceba484ecac83e680a8ec80acebba8de7ad8ceb97aaed80a1e8aabcec8db9eb84ade680a8eba3b9eca0b4efa698d188eb82afe68489
UHC 烏띾슢誼뤄쬃怨쀬뺍筌뗪퀡誼썹넭怨룹젴輦ш낯愉 1110100010100001100011011110101110011010101011101110101111111110101101111110111110100110100110101110101010110011100101111110110010111011101011101110111110100111100010111110101010110011100101011110101111111110101111011110011110000110101011001110101010110011101101111110110010100000101010001110011011100100101011001110101010110011101110001110101011110000 e8a18deb9aaeebfeb7efa69aeab397ecbbaeefa78beab395ebfebde786aceab3b7eca0a8e6e4aceab3b8eaf0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)