To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 翁??猷??音??齬??愉??源??娃??ザ 100010011010010100111111001111111001011101010001001111110011111110001001101110010011111100111111111010101001011100111111001111111001011011111001001111110011111110001100101110010011111100111111100010001010000100111111001111111000001101010101 89a53f3f97513f3f89b93f3fea973f3f96f93f3f8cb93f3f88a13f3f8355
EUC-JP 翁??猷??音??齬??愉??源??娃??ザ 101100101010011100111111001111111100110110110010001111110011111110110010101110110011111100111111111100111111011100111111001111111100110011111011001111110011111110111000101110110011111100111111101100001010001100111111001111111010010110110110 b2a73f3fcdb23f3fb2bb3f3ff3f73f3fccfb3f3fb8bb3f3fb0a33f3fa5b6
UTF-8 翁띾끃猷딃렟音우퍥齬잙벊愉놂쬁源녿짒娃빺우ザ 111001111011111110000001111010111001110110111110111010111000000110000011111001111000110010110111111010111001010010000011111010111010000010011111111010011001111110110011111011001001101010110000111011011000110110100101111010011011110110101100111011001001111010011001111010111011001010001010111001101000010010001001111010111000011010000010111011001010110010000001111001101011101010010000111010111000010110111111111011001010011110010010111001011010100010000011111010111011100110111010111011001001101010110000111000111000001010110110 e7bf81eb9dbeeb8183e78cb7eb9483eba09fe99fb3ec9ab0ed8da5e9bdacec9e99ebb28ae68489eb8682ecac81e6ba90eb85bfeca792e5a883ebb9baec9ab0e382b6
UHC 翁띾끃猷딃렟音우퍥齬잙벊愉놂쬁源녿짒娃빺우ザ 1110100010111010100011011110101110000101101110011110101110100011100010101110100110001110101100001110101111100101101111111110110010111011100111001110010111100001100111111110101110010011101011011110101011110000101100111110111110100110100110001110101010111001100001101110101110100011100111001110100011011111100101011100111010111111111011001010101110110110 e8ba8deb85b9eba38ae98eb0ebe5bfecbb9ce5e19feb93adeaf0b3efa698eab986eba39ce8df95cebfecabb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)