To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 宵。。v宵。。vB 100011111010101010100001101000011111001001111101011101101000111110101010101000011010000111110010011111010111011001000010 8faaa1a1f27d768faaa1a1f27d7642
EUC-JP 宵。。?v宵。。?vB 1011111010101100100011101010000110001110101000010011111101110110101111101010110010001110101000011000111010100001001111110111011001000010 beac8ea18ea13f76beac8ea18ea13f7642
UTF-8 宵。。v宵。。vB 111001011010111010110101111011111011110110100001111011111011110110100001111011101000011010110101011101101110010110101110101101011110111110111101101000011110111110111101101000011110111010000110101101010111011001000010 e5aeb5efbda1efbda1ee86b576e5aeb5efbda1efbda1ee86b57642
UHC 宵???v宵???vB 11100001101100100011111100111111001111110111011011100001101100100011111100111111001111110111011001000010 e1b23f3f3f76e1b23f3f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)