To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 逵シ蠎オv逵シ蠎オvB 111001111001110010111100111001011011101010110101011101101110011110011100101111001110010110111010101101010111011001000010 e79cbce5bab576e79cbce5bab57642
EUC-JP 逵シ蠎オv逵シ蠎オvB 11101101111111001000111010111100111010101011110010001110101101010111011011101101111111001000111010111100111010101011110010001110101101010111011001000010 edfc8ebceabc8eb576edfc8ebceabc8eb57642
UTF-8 逵シ蠎オv逵シ蠎オvB 111010011000000010110101111011111011110110111100111010001010000010001110111011111011110110110101011101101110100110000000101101011110111110111101101111001110100010100000100011101110111110111101101101010111011001000010 e980b5efbdbce8a08eefbdb576e980b5efbdbce8a08eefbdb57642
UHC 逵???v逵???vB 11010000101100000011111100111111001111110111011011010000101100000011111100111111001111110111011001000010 d0b03f3f3f76d0b03f3f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)