To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 翁??猷??音??齬??愉??湲??娃??踰 100010011010010100111111001111111001011101010001001111110011111110001001101110010011111100111111111010101001011100111111001111111001011011111001001111110011111110011111110100010011111100111111100010001010000100111111001111111110011011111010 89a53f3f97513f3f89b93f3fea973f3f96f93f3f9fd13f3f88a13f3fe6fa
EUC-JP 翁??猷??音??齬??愉??湲??娃??踰 101100101010011100111111001111111100110110110010001111110011111110110010101110110011111100111111111100111111011100111111001111111100110011111011001111110011111111011110110100110011111100111111101100001010001100111111001111111110110011111100 b2a73f3fcdb23f3fb2bb3f3ff3f73f3fccfb3f3fded33f3fb0a33f3fecfc
UTF-8 翁띾끃猷딃렟音우퍥齬잙벊愉녔를湲깃틚娃뺛뀾踰 111001111011111110000001111010111001110110111110111010111000000110000011111001111000110010110111111010111001010010000011111010111010000010011111111010011001111110110011111011001001101010110000111011011000110110100101111010011011110110101100111011001001111010011001111010111011001010001010111001101000010010001001111010111000010110010100111010111010010110111100111001101011100110110010111010101011100110000011111011011000101110011010111001011010100010000011111010111011101010011011111010111000000010111110111010001011100010110000 e7bf81eb9dbeeb8183e78cb7eb9483eba09fe99fb3ec9ab0ed8da5e9bdacec9e99ebb28ae68489eb8594eba5bce6b9b2eab983ed8b9ae5a883ebba9beb80bee8b8b0
UHC 翁띾끃猷딃렟音우퍥齬잙벊愉녔를湲깃틚娃뺛뀾踰 1110100010111010100011011110101110000101101110011110101110100011100010101110100110001110101100001110101111100101101111111110110010111011100111001110010111100001100111111110101110010011101011011110101011110000101100111110011010111000101001101110101010111000101100011110101010111010100001111110100011011111100101011110001110000101101101001110101110110010 e8ba8deb85b9eba38ae98eb0ebe5bfecbb9ce5e19feb93adeaf0b3e6b8a6eab8b1eaba87e8df95e385b4ebb2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)