To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????d}v?????????d}vB 00111111001111110011111100111111001111110011111100111111001111110011111101100100011111010111011000111111001111110011111100111111001111110011111100111111001111110011111101100100011111010111011001000010 3f3f3f3f3f3f3f3f3f647d763f3f3f3f3f3f3f3f3f647d7642
SJIS-WIN ?レ?乳??音??d}v?レ?乳??音??d}vB 00111111100000111000110000111111100100111111101100111111001111111000100110111001001111110011111101100100011111010111011000111111100000111000110000111111100100111111101100111111001111111000100110111001001111110011111101100100011111010111011001000010 3f838c3f93fb3f3f89b93f3f647d763f838c3f93fb3f3f89b93f3f647d7642
EUC-JP ?レ?乳??音??d}v?レ?乳??音??d}vB 00111111101001011110110000111111110001101111110100111111001111111011001010111011001111110011111101100100011111010111011000111111101001011110110000111111110001101111110100111111001111111011001010111011001111110011111101100100011111010111011001000010 3fa5ec3fc6fd3f3fb2bb3f3f647d763fa5ec3fc6fd3f3fb2bb3f3f647d7642
UTF-8 曆レ눘乳썸략音깃퐷d}v曆レ눘乳썸략音깃퐷d}vB 11101111101001101000101111100011100000111010110011101011100010001001100011100100101110011011001111101100100011011011100011101011100111101011010111101001100111111011001111101010101110011000001111101101100100001011011101100100011111010111011011101111101001101000101111100011100000111010110011101011100010001001100011100100101110011011001111101100100011011011100011101011100111101011010111101001100111111011001111101010101110011000001111101101100100001011011101100100011111010111011001000010 efa68be383aceb8898e4b9b3ec8db8eb9eb5e99fb3eab983ed90b7647d76efa68be383aceb8898e4b9b3ec8db8eb9eb5e99fb3eab983ed90b7647d7642
UHC 曆レ눘乳썸략音깃퐷d}v曆レ눘乳썸략音깃퐷d}vB 11100110101101111010101111101100100001111011000111101010111000011011110111100110101101111010101111101011111001011011000111101010101111011010000001100100011111010111011011100110101101111010101111101100100001111011000111101010111000011011110111100110101101111010101111101011111001011011000111101010101111011010000001100100011111010111011001000010 e6b7abec87b1eae1bde6b7abebe5b1eabda0647d76e6b7abec87b1eae1bde6b7abebe5b1eabda0647d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)