To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 閼ア謐厭v閼ア謐厭vB 1110100010000100101100011110011010001101100010010111110101110110111010001000010010110001111001101000110110001001011111010111011001000010 e884b1e68d897d76e884b1e68d897d7642
EUC-JP 閼ア謐厭v閼ア謐厭vB 11101111111001001000111010110001111010111110110110110001110111100111011011101111111001001000111010110001111010111110110110110001110111100111011001000010 efe48eb1ebedb1de76efe48eb1ebedb1de7642
UTF-8 閼ア謐厭v閼ア謐厭vB 111010011001011010111100111011111011110110110001111010001010110010010000111001011000111010101101011101101110100110010110101111001110111110111101101100011110100010101100100100001110010110001110101011010111011001000010 e996bcefbdb1e8ac90e58ead76e996bcefbdb1e8ac90e58ead7642
UHC 閼?謐厭v閼?謐厭vB 1110010011011001001111111101101011001101111001101111010001110110111001001101100100111111110110101100110111100110111101000111011001000010 e4d93fdacde6f476e4d93fdacde6f47642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)