To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 翁??猷??袁k?嵬??愉ユ????堊 100010011010010100111111001111111001011101010001001111110011111111100101110011011000001010001011001111111001101111001010001111110011111110010110111110011000001110000110001111110011111100111111001111111001101010111111 89a53f3f97513f3fe5cd828b3f9bca3f3f96f983863f3f3f3f9abf
EUC-JP 翁??猷??袁k?嵬??愉ユ?洧??堊 1011001010100111001111110011111111001101101100100011111100111111111010101100111110100011111010110011111111010110110011000011111100111111110011001111101110100101111001100011111110001111110001111011010000111111001111111101010011000001 b2a73f3fcdb23f3feacfa3eb3fd6cc3f3fccfba5e63f8fc7b43f3fd4c1
UTF-8 翁띾끃猷딃렟袁k츇嵬됯램愉ユ갭洧뺣뉼堊 111001111011111110000001111010111001110110111110111010111000000110000011111001111000110010110111111010111001010010000011111010111010000010011111111010001010001010000001111011111011110110001011111011001011100010000111111001011011010110101100111010111001000010101111111010111001111010101000111001101000010010001001111000111000001110100110111010101011000010101101111001101011010010100111111010111011101010100011111010111000100110111100111001011010000010001010 e7bf81eb9dbeeb8183e78cb7eb9483eba09fe8a281efbd8becb887e5b5aceb90afeb9ea8e68489e383a6eab0ade6b4a7ebbaa3eb89bce5a08a
UHC 翁띾끃猷딃렟袁k츇嵬됯램愉ユ갭洧뺣뉼堊 1110100010111010100011011110101110000101101110011110101110100011100010101110100110001110101100001110101010111110101000111110101110101110100001001110100011100011100010011110101010110111101001011110101011110000101010111110011010110000101110001110101011111011100101011110101110110100101111001110010010111110 e8ba8deb85b9eba38ae98eb0eabea3ebae84e8e389eab7a5eaf0abe6b0b8eafb95ebb4bce4be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)