To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 矮?第亘??鬱?}v矮?第亘??鬱?}vB 1110000111100010001111111001000111100110100110000110101000111111001111111001111101010100001111110111110101110110111000011110001000111111100100011110011010011000011010100011111100111111100111110101010000111111011111010111011001000010 e1e23f91e6986a3f3f9f543f7d76e1e23f91e6986a3f3f9f543f7d7642
EUC-JP 矮?第亘??鬱?}v矮?第亘??鬱?}vB 1110001011100100001111111100001011101000110011111100101100111111001111111101110110110101001111110111110101110110111000101110010000111111110000101110100011001111110010110011111100111111110111011011010100111111011111010111011001000010 e2e43fc2e8cfcb3f3fddb53f7d76e2e43fc2e8cfcb3f3fddb53f7d7642
UTF-8 矮렡第亘롛렡鬱렠}v矮렡第亘롛렡鬱렠}vB 1110011110011111101011101110101110100000101000011110011110101100101011001110010010111010100110001110101110100001100110111110101110100000101000011110100110101100101100011110101110100000101000000111110101110110111001111001111110101110111010111010000010100001111001111010110010101100111001001011101010011000111010111010000110011011111010111010000010100001111010011010110010110001111010111010000010100000011111010111011001000010 e79faeeba0a1e7acace4ba98eba19beba0a1e9acb1eba0a07d76e79faeeba0a1e7acace4ba98eba19beba0a1e9acb1eba0a07d7642
UHC 矮렡第亘롛렡鬱렠}v矮렡第亘롛렡鬱렠}vB 11101000111000011000111010110010111100001010111111010000111001101000111011011111100011101011001011101010101001101000111010110001011111010111011011101000111000011000111010110010111100001010111111010000111001101000111011011111100011101011001011101010101001101000111010110001011111010111011001000010 e8e18eb2f0afd0e68edf8eb2eaa68eb17d76e8e18eb2f0afd0e68edf8eb2eaa68eb17d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)