To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?ぁ?い’?}????悠い?日お???杷い 00111111100000101001111100111111100000101010001010000001011001100011111110000001011100000011111100111111001111110011111110010111010010011000001010100010001111111001001111111010100000101010100000111111001111110011111110010100011001101000001010100010 3f829f3f82a281663f81703f3f3f3f974982a23f93fa82a83f3f3f946682a2
EUC-JP ?ぁ?い’?}????悠い?日お???杷い 00111111101001001010000100111111101001001010010010100001110001110011111110100001110100010011111100111111001111110011111111001101101010101010010010100100001111111100011011111100101001001010101000111111001111110011111111000111110001111010010010100100 3fa4a13fa4a4a1c73fa1d13f3f3f3fcdaaa4a43fc6fca4aa3f3f3fc7c7a4a4
UTF-8 룵ぁ캀い’룵}룵₃룵ㄱ悠い룫日お▩룵ㄱ杷い 111010111010001110110101111000111000000110000001111011001011101010000000111000111000000110000100111000101000000010011001111010111010001110110101111011111011110110011101111010111010001110110101111000101000001010000011111010111010001110110101111000111000010010110001111001101000001010100000111000111000000110000100111010111010001110101011111001101001011110100101111000111000000110001010111000101001011010101001111010111010001110110101111000111000010010110001111001101001110110110111111000111000000110000100 eba3b5e38181ecba80e38184e28099eba3b5efbd9deba3b5e28283eba3b5e384b1e682a0e38184eba3abe697a5e3818ae296a9eba3b5e384b1e69db7e38184
UHC 룵ぁ캀い’룵}룵₃룵ㄱ悠い룫日お▩룵ㄱ杷い 100011111010101010101010101000011010111110001111101010101010010010100001101011111000111110101010101000111111110110001111101010101010100111111101100011111010101010100100101000011110101011101101101010101010010010001111101000101110110011101101101010101010101010100010110011001000111110101010101001001010000111110111111011011010101010100100 8faaaaa1af8faaa4a1af8faaa3fd8faaa9fd8faaa4a1eaedaaa48fa2ecedaaaaa2cc8faaa4a1f7edaaa4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)