To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????O[?????????O[^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111101011011001111110011111100111111001111110011111100111111001111110011111100111111010011110101101101011110 3f3f3f3f3f3f3f3f3f4f5b3f3f3f3f3f3f3f3f3f4f5b5e
SJIS-WIN 涅??彦??汝??O[涅??彦??汝??O[^ 1001111110111000001111110011111110010101010001100011111100111111100100111111000000111111001111110100111101011011100111111011100000111111001111111001010101000110001111110011111110010011111100000011111100111111010011110101101101011110 9fb83f3f95463f3f93f03f3f4f5b9fb83f3f95463f3f93f03f3f4f5b5e
EUC-JP 涅??彦??汝??O[涅??彦??汝??O[^ 1101111010111010001111110011111111001001101001110011111100111111110001101111001000111111001111110100111101011011110111101011101000111111001111111100100110100111001111110011111111000110111100100011111100111111010011110101101101011110 deba3f3fc9a73f3fc6f23f3f4f5bdeba3f3fc9a73f3fc6f23f3f4f5b5e
UTF-8 涅듭옱彦붹쐿汝끺퐰O[涅듭옱彦붹쐿汝끺퐰O[^ 1110011010110110100001011110101110010011101011011110110010011000101100011110010110111101101001101110101110110110101110011110110010010000101111111110011010110001100111011110101110000001101110101110110110010000101100000100111101011011111001101011011010000101111010111001001110101101111011001001100010110001111001011011110110100110111010111011011010111001111011001001000010111111111001101011000110011101111010111000000110111010111011011001000010110000010011110101101101011110 e6b685eb93adec98b1e5bda6ebb6b9ec90bfe6b19deb81baed90b04f5be6b685eb93adec98b1e5bda6ebb6b9ec90bfe6b19deb81baed90b04f5b5e
UHC 涅듭옱彦붹쐿汝끺퐰O[涅듭옱彦붹쐿汝끺퐰O[^ 1110011011101110101101011110110010011110101011001110010111101001100101001110011010011100100111111110011010100011100001011110010010111101100110010100111101011011111001101110111010110101111011001001111010101100111001011110100110010100111001101001110010011111111001101010001110000101111001001011110110011001010011110101101101011110 e6eeb5ec9eace5e994e69c9fe6a385e4bd994f5be6eeb5ec9eace5e994e69c9fe6a385e4bd994f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)