To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???F}v???F}vB 00111111001111110011111101000110011111010111011000111111001111110011111101000110011111010111011001000010 3f3f3f467d763f3f3f467d7642
SJIS-WIN 汚手?F}v汚手?F}vB 1000100110011000100011101110100000111111010001100111110101110110100010011001100010001110111010000011111101000110011111010111011001000010 89988ee83f467d7689988ee83f467d7642
EUC-JP 汚手?F}v汚手?F}vB 1011000111111000101111001110101000111111010001100111110101110110101100011111100010111100111010100011111101000110011111010111011001000010 b1f8bcea3f467d76b1f8bcea3f467d7642
UTF-8 汚手쐝F}v汚手쐝F}vB 11100110101100011001101011100110100010011000101111101100100100001001110101000110011111010111011011100110101100011001101011100110100010011000101111101100100100001001110101000110011111010111011001000010 e6b19ae6898bec909d467d76e6b19ae6898bec909d467d7642
UHC 汚手쐝F}v汚手쐝F}vB 11100111111111011110001010100010100111001000001101000110011111010111011011100111111111011110001010100010100111001000001101000110011111010111011001000010 e7fde2a29c83467d76e7fde2a29c83467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)