To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 誤??揖??碎??[誤??揖??碎??[^ 100011001110101100111111001111111001011101001011001111110011111111100001111010100011111100111111010110111000110011101011001111110011111110010111010010110011111100111111111000011110101000111111001111110101101101011110 8ceb3f3f974b3f3fe1ea3f3f5b8ceb3f3f974b3f3fe1ea3f3f5b5e
EUC-JP 誤??揖??碎??[誤??揖??碎??[^ 101110001110110100111111001111111100110110101100001111110011111111100010111011000011111100111111010110111011100011101101001111110011111111001101101011000011111100111111111000101110110000111111001111110101101101011110 b8ed3f3fcdac3f3fe2ec3f3f5bb8ed3f3fcdac3f3fe2ec3f3f5b5e
UTF-8 誤곣뫜揖덅쫨碎쇱뎵[誤곣뫜揖덅쫨碎쇱뎵[^ 111010001010101010100100111010101011001110100011111010111010101110011100111001101000111110010110111010111000110110000101111011001010101110101000111001111010001010001110111011001000011110110001111010111000111010110101010110111110100010101010101001001110101010110011101000111110101110101011100111001110011010001111100101101110101110001101100001011110110010101011101010001110011110100010100011101110110010000111101100011110101110001110101101010101101101011110 e8aaa4eab3a3ebab9ce68f96eb8d85ecaba8e7a28eec87b1eb8eb55be8aaa4eab3a3ebab9ce68f96eb8d85ecaba8e7a28eec87b1eb8eb55b5e
UHC 誤곣뫜揖덅쫨碎쇱뎵[誤곣뫜揖덅쫨碎쇱뎵[^ 111010001010011010000001111000101001000110111100111010111110011110001000111010001010011010000001111000011110111110111100111011001000100110001000010110111110100010100110100000011110001010010001101111001110101111100111100010001110100010100110100000011110000111101111101111001110110010001001100010000101101101011110 e8a681e291bcebe788e8a681e1efbcec89885be8a681e291bcebe788e8a681e1efbcec89885b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)