To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 證殊サ溢き諱ゎエ 1110011010011010100011101110101010111011100010001110110010000010101010111110011010000001100000101110110010110100 e69a8eeabb88ec82abe68182ecb4
EUC-JP 證殊サ溢き諱ゎエ 11101011111110101011110011101100100011101011101110110000111011101010010010101101111010111110000110100100111011101000111010110100 ebfabcec8ebbb0eea4adebe1a4ee8eb4
UTF-8 證殊サ溢き諱ゎエ 111010001010110110001001111001101010111010001010111011111011110110111011111001101011101010100010111000111000000110001101111010001010101110110001111000111000001010001110111011111011110110110100 e8ad89e6ae8aefbdbbe6baa2e3818de8abb1e3828eefbdb4
UHC 證殊?溢き諱ゎ? 1111000111111011111000101010100000111111111011001110111010101010101011011111110111001001101010101110111000111111 f1fbe2a83feceeaaadfdc9aaee3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)