To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 沃??毅??碎??N}沃??毅??碎??N{^ 1001011110000000001111110011111110001011010000100011111100111111111000011110101000111111001111110100111001111101100101111000000000111111001111111000101101000010001111110011111111100001111010100011111100111111010011100111101101011110 97803f3f8b423f3fe1ea3f3f4e7d97803f3f8b423f3fe1ea3f3f4e7b5e
EUC-JP 沃??毅??碎??N}沃??毅??碎??N{^ 1100110111100000001111110011111110110101101000110011111100111111111000101110110000111111001111110100111001111101110011011110000000111111001111111011010110100011001111110011111111100010111011000011111100111111010011100111101101011110 cde03f3fb5a33f3fe2ec3f3f4e7dcde03f3fb5a33f3fe2ec3f3f4e7b5e
UTF-8 沃섃뫁毅㎬첋碎ㅻ첇N}沃섃뫁毅㎬첋碎ㅻ첇N{^ 1110011010110010100000111110110010000100100000111110101110101011100000011110011010101111100001011110001110001110101011001110110010110010100010111110011110100010100011101110001110000101101110111110110010110010100001110100111001111101111001101011001010000011111011001000010010000011111010111010101110000001111001101010111110000101111000111000111010101100111011001011001010001011111001111010001010001110111000111000010110111011111011001011001010000111010011100111101101011110 e6b283ec8483ebab81e6af85e38eacecb28be7a28ee385bbecb2874e7de6b283ec8483ebab81e6af85e38eacecb28be7a28ee385bbecb2874e7b5e
UHC 沃섃뫁毅㎬첋碎ㅻ첇N}沃섃뫁毅㎬첋碎ㅻ첇N{^ 1110100010101010100110001110001010010001101001011110101111110110101001111110100010101010100110001110000111101111101001001110101110101010100101000100111001111101111010001010101010011000111000101001000110100101111010111111011010100111111010001010101010011000111000011110111110100100111010111010101010010100010011100111101101011110 e8aa98e291a5ebf6a7e8aa98e1efa4ebaa944e7de8aa98e291a5ebf6a7e8aa98e1efa4ebaa944e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)