To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????諛??齬??媛??純??堊?Ⅲ 001111110011111100111111001111110011111100111111111001101000011100111111001111111110101010010111001111110011111110010101010100010011111100111111100011111000001100111111001111111001101010111111001111111000011101010110 3f3f3f3f3f3fe6873f3fea973f3f95513f3f8f833f3f9abf3f8756
EUC-JP ??????諛??齬??媛??純??堊?? 0011111100111111001111110011111100111111001111111110101111100111001111110011111111110011111101110011111100111111110010011011001000111111001111111011110111100011001111110011111111010100110000010011111100111111 3f3f3f3f3f3febe73f3ff3f73f3fc9b23f3fbde33f3fd4c13f3f
UTF-8 捻뀀뜄梨뜹츦諛곕룥齬잆굤媛뺝윜純볥깹堊앹Ⅲ 111011111010011010100100111010111000000010000000111010111001110010000100111011111010011110100010111010111001110010111001111011001011100010100110111010001010101110011011111010101011001110010101111010111010001110100101111010011011110110101100111011001001111010000110111010101011010110100100111001011010101010011011111010111011101010011101111011001001110010011100111001111011010010010100111010111011001110100101111010101011100110111001111001011010000010001010111011001001010110111001111000101000010110100010 efa6a4eb8080eb9c84efa7a2eb9cb9ecb8a6e8ab9beab395eba3a5e9bdacec9e86eab5a4e5aa9bebba9dec9c9ce7b494ebb3a5eab9b9e5a08aec95b9e285a2
UHC 捻뀀뜄梨뜹츦諛곕룥齬잆굤媛뺝윜純볥깹堊앹Ⅲ 111001101111011110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111011000010110000111010111000111110011110111001011110000110011111111000111000001010001010111010101011000010010101111001011001111110011111111000101110110110010011111010111011001010100001111001001011111010011101111011001010010110110010 e6f7b2eb8d88ecb1b6e5ae9cebb0b0eb8f9ee5e19fe3828aeab095e59f9fe2ed93ebb2a1e4be9deca5b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)