To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN 也ゅ?言??餓??h也ゅ?言??餓?? 100101101110011110000010111000110011111110001100101111100011111100111111100010011110110000111111001111110110100010010110111001111000001011100011001111111000110010111110001111110011111110001001111011000011111100111111 96e782e33f8cbe3f3f89ec3f3f6896e782e33f8cbe3f3f89ec3f3f
EUC-JP 也ゅ?言??餓??h也ゅ?言??餓?? 110011001110100110100100111001010011111110111000110000000011111100111111101100101110111000111111001111110110100011001100111010011010010011100101001111111011100011000000001111110011111110110010111011100011111100111111 cce9a4e53fb8c03f3fb2ee3f3f68cce9a4e53fb8c03f3fb2ee3f3f
UTF-8 也ゅ끀言됭갬餓뽨퓘h也ゅ끀言됭갬餓뽨퓘 11100100101110011001111111100011100000101000010111101011100000011000000011101000101010001000000011101011100100001010110111101010101100001010110011101001101001001001001111101011101111011010100011101101100100111001100001101000111001001011100110011111111000111000001010000101111010111000000110000000111010001010100010000000111010111001000010101101111010101011000010101100111010011010010010010011111010111011110110101000111011011001001110011000 e4b99fe38285eb8180e8a880eb90adeab0ace9a493ebbda8ed939868e4b99fe38285eb8180e8a880eb90adeab0ace9a493ebbda8ed9398
UHC 也ゅ끀言됭갬餓뽨퓘h也ゅ끀言됭갬餓뽨퓘 11100101101001011010101011100101100001011011011011100101111010111000100111101000101100001011011111100100101110111001011011100100101111111000001101101000111001011010010110101010111001011000010110110110111001011110101110001001111010001011000010110111111001001011101110010110111001001011111110000011 e5a5aae585b6e5eb89e8b0b7e4bb96e4bf8368e5a5aae585b6e5eb89e8b0b7e4bb96e4bf83

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)