To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
EUC-JP ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
UTF-8 션샷셸셧v션샷셸셧vB 111011001000010110011000111011001000001110110111111011001000010110111000111011001000010110100111011101101110110010000101100110001110110010000011101101111110110010000101101110001110110010000101101001110111011001000010 ec8598ec83b7ec85b8ec85a776ec8598ec83b7ec85b8ec85a77642
UHC 션샷셸셧v션샷셸셧vB 10111100110001111011110010100110101111001101000010111100110010110111011010111100110001111011110010100110101111001101000010111100110010110111011001000010 bcc7bca6bcd0bccb76bcc7bca6bcd0bccb7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)