To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 鉛??嗚??悅??U}鉛??嗚??悅??U{^ 1000100110010100001111110011111110011010011010100011111100111111111110101011110100111111001111110101010101111101100010011001010000111111001111111001101001101010001111110011111111111010101111010011111100111111010101010111101101011110 89943f3f9a6a3f3ffabd3f3f557d89943f3f9a6a3f3ffabd3f3f557b5e
EUC-JP 鉛??嗚?????U}鉛??嗚?????U{^ 101100011111010000111111001111111101001111001011001111110011111100111111001111110011111101010101011111011011000111110100001111110011111111010011110010110011111100111111001111110011111100111111010101010111101101011110 b1f43f3fd3cb3f3f3f3f3f557db1f43f3fd3cb3f3f3f3f3f557b5e
UTF-8 鉛당뜌嗚잏땰悅롧뙻U}鉛당뜌嗚잏땰悅롧뙻U{^ 1110100110001001100110111110101110001011101110011110101110011100100011001110010110010111100110101110110010011110100011111110101110010101101100001110011010000010100001011110101110100001101001111110101110011001101110110101010101111101111010011000100110011011111010111000101110111001111010111001110010001100111001011001011110011010111011001001111010001111111010111001010110110000111001101000001010000101111010111010000110100111111010111001100110111011010101010111101101011110 e9899beb8bb9eb9c8ce5979aec9e8feb95b0e68285eba1a7eb99bb557de9899beb8bb9eb9c8ce5979aec9e8feb95b0e68285eba1a7eb99bb557b5e
UHC 鉛당뜌嗚잏땰悅롧뙻U}鉛당뜌嗚잏땰悅롧뙻U{^ 1110011011100111101101001110011110001101100011111110011111110000100111111110011110001011100001101110011011101101100011101110011110001100101111100101010101111101111001101110011110110100111001111000110110001111111001111111000010011111111001111000101110000110111001101110110110001110111001111000110010111110010101010111101101011110 e6e7b4e78d8fe7f09fe78b86e6ed8ee78cbe557de6e7b4e78d8fe7f09fe78b86e6ed8ee78cbe557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)