To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????臾??筌??揖?┥怨???湲?? 00111111001111110011111100111111001111110011111111100100011010110011111100111111111000101010001100111111001111111001011101001011001111111000010010111100100010011000010100111111001111110011111110011111110100010011111100111111 3f3f3f3f3f3fe46b3f3fe2a33f3f974b3f84bc89853f3f3f9fd13f3f
EUC-JP ??????臾??筌??揖?┥怨???湲?? 00111111001111110011111100111111001111110011111111100111110011000011111100111111111001001010010100111111001111111100110110101100001111111010100010111110101100011110010100111111001111110011111111011110110100110011111100111111 3f3f3f3f3f3fe7cc3f3fe4a53f3fcdac3fa8beb1e53f3f3fded33f3f
UTF-8 嶺뚮뿪留됪룚臾뺤춷筌믨퀗揖쏙┥怨빬딂땻湲몃왂 111011111010011010101011111010111001101010101110111010111011111110101010111011111010011110001101111010111001000010101010111010111010001110011010111010001000011110111110111010111011101010100100111011001011011010110111111001111010110110001100111010111010111110101000111011011000000010010111111001101000111110010110111011001000111110011001111000101001010010100101111001101000000010101000111010111011100110101100111010111001010010000010111010111001010110111011111001101011100110110010111010111010101010000011111011001001100110000010 efa6abeb9aaeebbfaaefa78deb90aaeba39ae887beebbaa4ecb6b7e7ad8cebafa8ed8097e68f96ec8f99e294a5e680a8ebb9aceb9482eb95bbe6b9b2ebaa83ec9982
UHC 嶺뚮뿪留됪룚臾뺤춷筌믨퀗揖쏙┥怨빬딂땻湲몃왂 1110011110101101100011001110101110010111101010101110101110100111100010011110011010001111100101101110101110101100100101011110110010101101100100111110111110100111100100101110101010110011100011001110101111100111101111011110111110100110101111101110101010110011100101011100010110001010111010001000101110010001111010101011100010111000111010111001111010110101 e7ad8ceb97aaeba789e68f96ebac95ecad93efa792eab38cebe7bdefa6beeab395c58ae88b91eab8b8eb9eb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)