To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 蜈??阿??閻??}v蜈??阿??閻??}vB 1110010110000101001111110011111110001000101000100011111100111111111010001000010100111111001111110111110101110110111001011000010100111111001111111000100010100010001111110011111111101000100001010011111100111111011111010111011001000010 e5853f3f88a23f3fe8853f3f7d76e5853f3f88a23f3fe8853f3f7d7642
EUC-JP 蜈??阿??閻??}v蜈??阿??閻??}vB 1110100111100101001111110011111110110000101001000011111100111111111011111110010100111111001111110111110101110110111010011110010100111111001111111011000010100100001111110011111111101111111001010011111100111111011111010111011001000010 e9e53f3fb0a43f3fefe53f3f7d76e9e53f3fb0a43f3fefe53f3f7d7642
UTF-8 蜈욅쥢阿숃쪥閻뺡눋}v蜈욅쥢阿숃쪥閻뺡눋}vB 1110100010011100100010001110110010011010100001011110110010100101101000101110100110011000101111111110110010001000100000111110110010101010101001011110100110010110101110111110101110111010101000011110101110001000100010110111110101110110111010001001110010001000111011001001101010000101111011001010010110100010111010011001100010111111111011001000100010000011111011001010101010100101111010011001011010111011111010111011101010100001111010111000100010001011011111010111011001000010 e89c88ec9a85eca5a2e998bfec8883ecaaa5e996bbebbaa1eb888b7d76e89c88ec9a85eca5a2e998bfec8883ecaaa5e996bbebbaa1eb888b7d7642
UHC 蜈욅쥢阿숃쪥閻뺡눋}v蜈욅쥢阿숃쪥閻뺡눋}vB 1110100010100101100111101110011110100010100101011110010010111001100110011110100010100101100111101110011110100010100101011110100110110100101011000111110101110110111010001010010110011110111001111010001010010101111001001011100110011001111010001010010110011110111001111010001010010101111010011011010010101100011111010111011001000010 e8a59ee7a295e4b999e8a59ee7a295e9b4ac7d76e8a59ee7a295e4b999e8a59ee7a295e9b4ac7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)