To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 髣厄スー陷奇スア}v髣厄スー陷奇スア}vB 1110100110010111100101101110111110111101101100001110100010011100100010101110111110111101101100010111110101110110111010011001011110010110111011111011110110110000111010001001110010001010111011111011110110110001011111010111011001000010 e99796efbdb0e89c8aefbdb17d76e99796efbdb0e89c8aefbdb17d7642
EUC-JP 髣厄スー陷奇スア}v髣厄スー陷奇スア}vB 11110001111101111100110011110001100011101011110110001110101100001110111111111100101101001111000110001110101111011000111010110001011111010111011011110001111101111100110011110001100011101011110110001110101100001110111111111100101101001111000110001110101111011000111010110001011111010111011001000010 f1f7ccf18ebd8eb0effcb4f18ebd8eb17d76f1f7ccf18ebd8eb0effcb4f18ebd8eb17d7642
UTF-8 髣厄スー陷奇スア}v髣厄スー陷奇スア}vB 1110100110101011101000111110010110001110100001001110111110111101101111011110111110111101101100001110100110011001101101111110010110100101100001111110111110111101101111011110111110111101101100010111110101110110111010011010101110100011111001011000111010000100111011111011110110111101111011111011110110110000111010011001100110110111111001011010010110000111111011111011110110111101111011111011110110110001011111010111011001000010 e9aba3e58e84efbdbdefbdb0e999b7e5a587efbdbdefbdb17d76e9aba3e58e84efbdbdefbdb0e999b7e5a587efbdbdefbdb17d7642
UHC ?厄??陷奇??}v?厄??陷奇??}vB 001111111110010011111000001111110011111111111001111010001101000011110100001111110011111101111101011101100011111111100100111110000011111100111111111110011110100011010000111101000011111100111111011111010111011001000010 3fe4f83f3ff9e8d0f43f3f7d763fe4f83f3ff9e8d0f43f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)