To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 紆???紆?弔枋紆???紆?弔裴紆???紆?弔訪^ 11100010111111000011111100111111001111111110001011111100001111111001001010100010100111100110001111100010111111000011111100111111001111111110001011111100001111111001001010100010111001011110100011100010111111000011111100111111001111111110001011111100001111111001001010100010100101100100101101011110 e2fc3f3f3fe2fc3f92a29e63e2fc3f3f3fe2fc3f92a2e5e8e2fc3f3f3fe2fc3f92a2964b5e
EUC-JP 紆?檉?紆?弔枋紆?檉?紆?弔裴紆?檉?紆?弔訪^ 11100100111111100011111110001111110001011011101100111111111001001111111000111111110001001010010011011011110001001110010011111110001111111000111111000101101110110011111111100100111111100011111111000100101001001110101011101010111001001111111000111111100011111100010110111011001111111110010011111110001111111100010010100100110010111010110001011110 e4fe3f8fc5bb3fe4fe3fc4a4dbc4e4fe3f8fc5bb3fe4fe3fc4a4eaeae4fe3f8fc5bb3fe4fe3fc4a4cbac5e
UTF-8 紆렡檉렢紆렡弔枋紆렡檉렢紆렡弔裴紆렡檉렢紆렡弔訪^ 11100111101101001000011011101011101000001010000111100110101010101000100111101011101000001010001011100111101101001000011011101011101000001010000111100101101111001001010011100110100111101000101111100111101101001000011011101011101000001010000111100110101010101000100111101011101000001010001011100111101101001000011011101011101000001010000111100101101111001001010011101000101000111011010011100111101101001000011011101011101000001010000111100110101010101000100111101011101000001010001011100111101101001000011011101011101000001010000111100101101111001001010011101000101010001010101001011110 e7b486eba0a1e6aa89eba0a2e7b486eba0a1e5bc94e69e8be7b486eba0a1e6aa89eba0a2e7b486eba0a1e5bc94e8a3b4e7b486eba0a1e6aa89eba0a2e7b486eba0a1e5bc94e8a8aa5e
UHC 紆렡檉렢紆렡弔枋紆렡檉렢紆렡弔裴紆렡檉렢紆렡弔訪^ 11101001111000011000111010110010111011111110000010001110101100111110100111100001100011101011001011110000110000001101101110110011111010011110000110001110101100101110111111100000100011101011001111101001111000011000111010110010111100001100000011011011110100001110100111100001100011101011001011101111111000001000111010110011111010011110000110001110101100101111000011000000110110111011111001011110 e9e18eb2efe08eb3e9e18eb2f0c0dbb3e9e18eb2efe08eb3e9e18eb2f0c0dbd0e9e18eb2efe08eb3e9e18eb2f0c0dbbe5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)