To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 猥k??ゆ??l??l?猥k??ゆ??l??l?^ 1110000011001110100000101000101100111111001111111000001011100100001111110011111110000010100011000011111100111111100000101000110000111111111000001100111010000010100010110011111100111111100000101110010000111111001111111000001010001100001111110011111110000010100011000011111101011110 e0ce828b3f3f82e43f3f828c3f3f828c3fe0ce828b3f3f82e43f3f828c3f3f828c3f5e
EUC-JP 猥k??ゆ??l??l?猥k??ゆ??l??l?^ 1110000011010000101000111110101100111111001111111010010011100110001111110011111110100011111011000011111100111111101000111110110000111111111000001101000010100011111010110011111100111111101001001110011000111111001111111010001111101100001111110011111110100011111011000011111101011110 e0d0a3eb3f3fa4e63f3fa3ec3f3fa3ec3fe0d0a3eb3f3fa4e63f3fa3ec3f3fa3ec3f5e
UTF-8 猥k㈃溜ゆ짂溜l뼐溜l뙣猥k㈃溜ゆ짂溜l뼐溜l뙟^ 11100111100011001010010111101111101111011000101111100011100010001000001111101111101001111000101111100011100000101000011011101100101001111000001011101111101001111000101111101111101111011000110011101011101111001001000011101111101001111000101111101111101111011000110011101011100110011010001111100111100011001010010111101111101111011000101111100011100010001000001111101111101001111000101111100011100000101000011011101100101001111000001011101111101001111000101111101111101111011000110011101011101111001001000011101111101001111000101111101111101111011000110011101011100110011001111101011110 e78ca5efbd8be38883efa78be38286eca782efa78befbd8cebbc90efa78befbd8ceb99a3e78ca5efbd8be38883efa78be38286eca782efa78befbd8cebbc90efa78befbd8ceb999f5e
UHC 猥k㈃溜ゆ짂溜l뼐溜l뙣猥k㈃溜ゆ짂溜l뼐溜l뙟^ 11101000111001011010001111101011101010011011010011101010111111101010101011100110101000111001001011101010111111101010001111101100100101101001100011101010111111101010001111101100100011001010100011101000111001011010001111101011101010011011010011101010111111101010101011100110101000111001001011101010111111101010001111101100100101101001100011101010111111101010001111101100100011001010010001011110 e8e5a3eba9b4eafeaae6a392eafea3ec9698eafea3ec8ca8e8e5a3eba9b4eafeaae6a392eafea3ec9698eafea3ec8ca45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)