To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 搖??踰??受??歪?9異??濡ル?億 10011101100010100011111100111111111001101111101000111111001111111000111011110011001111110011111110011000011000110011111110000010010110001000100011011001001111110011111110010100010001111000001110001011001111111000100110101101 9d8a3f3fe6fa3f3f8ef33f3f98633f825888d93f3f9447838b3f89ad
EUC-JP 搖??踰??受??歪?9異??濡ル?億 11011001111010100011111100111111111011001111110000111111001111111011110011110101001111110011111111001111110001000011111110100011101110011011000011011011001111110011111111000111101010001010010111101011001111111011001010101111 d9ea3f3fecfc3f3fbcf53f3fcfc43fa3b9b0db3f3fc7a8a5eb3fb2af
UTF-8 搖깅쵎踰됭린受쇱젂歪묐9異룬뼸濡ル쾳億 111001101001000010010110111010101011100110000101111011001011010110001110111010001011100010110000111010111001000010101101111010111010011010110000111001011000111110010111111011001000011110110001111011001010000010000010111001101010110110101010111010111010110010010000111011111011110010011001111001111001010110110000111010111010001110101100111010111011110010111000111001101011111110100001111000111000001110101011111011001011111010110011111001011000010010000100 e69096eab985ecb58ee8b8b0eb90adeba6b0e58f97ec87b1eca082e6adaaebac90efbc99e795b0eba3acebbcb8e6bfa1e383abecbeb3e58484
UHC 搖깅쵎踰됭린受쇱젂歪묐9異룬뼸濡ル쾳億 1110100011110100101100011110101110101100100100001110101110110010100010011110100010111000101100001110000111110100101111001110110010100000100001101110100011100000100100011110101110100011101110011110110010110110101101111110100110010110101110111110101110100001101010111110101110110010100010011110010111100010 e8f4b1ebac90ebb289e8b8b0e1f4bceca086e8e091eba3b9ecb6b7e996bbeba1abebb289e5e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)